The 5-Electron case of Thomson's Problem 

Richard Evan Schwartz * 
February 8, 2010 

Abstract 

We give a rigorous, computer-assisted proof that the triangular 
bi-pyramid is the unique configuration of 5 points on the sphere that 
globally minimizes the Coulomb (1/r) potential. We also prove the 
same result for the (1/r 2 ) potential. The main mathematical contri- 
bution of the paper is a fairly efficient energy estimate that works for 
any number of points and any power law potential. 

1 Introduction 

1.1 Background 

The problem of finding how electrons optimally distribute themselves on the 
sphere is a well-known and difficult one. It is known as Thomson's problem, 
and dates from J. J. Thomson's 1904 publication [T]. Thomson's problem is 
of interest not just to mathematicians, but also to physicists and chemists. 

Here is a mathematical formulation. Let S 2 C -R 3 be the unit sphere. 
Let P be a collection of n distinct point pi, ...,p n G S 2 . Let E : R + — > R + 
be some function, the total energy to be the sum 

E(P) = Y ; E(\\ Pi -p j \\). (1) 

i>j 

Here || • || is the usual norm on i? 3 , and \\pi —pj\\ is the distance from p$ to pj. 
The question is then: What does P look like if S(P) is as small as possible? 
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The question is perhaps too broad as stated, because the answer likely 
depends on the function E. To consider a narrower question, one restricts 
the class of functions in some way. For instance, a natural class of potential 
functions is given by the power laws 

£(r) = i; ee(0,oo). (2) 

The case e = 1 is specially interesting to physicists. It is known as the 
Coulomb potential. When energy is measured with respect to the Coulomb 
potential, the points are naturally considered to be electrons. 

There is a large literature on Thomson's problem. One early work on 
Thomson's problem is [C]. The paper [SK] gives a nice survey in the two 
dimensional case, with an emphasis on the case when n is large. The paper 
[BBCGKS] gives a survey of results, both theoretical and experimental, 
about highly symmetric configurations in higher dimensions. The fairly re- 
cent paper [RZS] has some theoretical bounds for the logarithmic potential, 
and also has a large amount of experimental information about configura- 
tions minimizing the power laws on the 2-sphere. The website [CCD] has 
a list of experimentally determined (candidate) minimizers for the Coulomb 
potential for n = 2, 972. 

There are certain values of n where the minimal configuration is rigorously 
known for all the power laws. 

• When n — 2, the points of P are antipodal. 

• When n — 3, the points of P make an equilateral triangle in an equator. 

• When n — 4, the points of P are the vertices of a regular tetrahedron. 

• When n — 6, the points of P are the vertices of a regular octahedron. 

• When n = 12, the points of P are the vertices of a regular icosahedron. 

The cases n = 2 is trivial to prove, and the case n = 3 is an easy exercise. 
The cases n = 4,6, 12 are all covered in [CK, Theorem 1.2], a much broader 
result concerning a fairly general kind of energy potential and a class of point 
configurations (not restricted to 2 dimensions) called sharp configurations. 
For the specific cases cases n = 4, 6 see the older work [Y]. 

The case n = 5 is conspicuously absent from the above list of known 
results. Everyone agrees, based on numerical simulation, that the triangular 
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bi-pyramid (TBP) seems to be the global minimizer for the Coulomb poten- 
tial. In the TBP, two points are antipodal points on S 2 and the remaining 
3 points form an equilateral triangle on the equator midway between the 
two antipodal points. More generally, numerical experiments suggest the 
following. 

• The TBP is a local minimizer for the power law potential with exponent 
e if and only if e G (0, d), with e x « 21.147123. 

• The TBP is a global minimizer for the power law potential with expo- 
nent e if and only if e 6 (0, e 2 ), with e 2 ~ 15.040808. 

For e > e2, it seems that the global minimizer is a pryamid with square base. 
The precise configuration depends on e in this range. 

In spite of detailed experimental knowledge about the case n = 5, it 
seems there has not ever been a proof that the TBP is global minimum for 
any power law potential. In particular, this has not been proved for the 
Coulomb potential. As far as we know, there are two rigorous results for the 
case n — 5. 

• The paper [DLT] contains a (traditional) proof that the TBP maxi- 
mizes the geometric mean of the pairwise distances between the points. 
This case corresponds to the logarithmic potential E(r) = — log(r). 

• The paper [HS] contains a computer-aided proof that the TBP maxi- 
mizes the potential for the exponent e = —1. That is, 5 points on the 
sphere arrange themselves into a TBP so as to maximize the total sum 
of the pairwise distances. 

1.2 Results 

It is the purpose of this paper to prove the following result. 

Theorem 1.1 The TBP is the unique configuration of minimal energy with 
respect to the Coulomb potential. 

Our proof is computer aided, and similar in spirit to [HS]. While the 
argument in [HS] is exactly tailored to understanding the sums of the dis- 
tances, our method is rather insensitive to the precise power law being used. 
Just to illustrate this fact, we prove 

1 Lacking a handy reference for these experiments, we performed our own. 
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Theorem 1.2 The TBP is the unique configuration of minimal energy with 
respect to the 1/r 2 potential. 

We could certainly add other exponents to our list of results. However, 
we currently have implemented the interval arithmetic in such a way that 
it works just for the exponents e = 1,2. We did this to avoid using the 
pow function pow(a, b) = a b , which is not covered by the same guarantees in 
the IEEE standards clS cLFG the basic arithmetic operations. See [I] and [12]. 
Mainly what stops us is a sense of diminishing returns. 

However, with a view towards an eventually broader application, we try 
to state as many estimates as we can for the general power law. In every 
situation, we try to point out the exact generality with which the construction 
holds. For example, our main technical result, Theorem I5.1[ holds for any 
function E satisfies the conditions discussed in §5.21 Also, the version of 
Theorem 15.11 we prove works for general n-point configurations. 

1.3 Outline of the Proof 

It has probably been clear for a long time that one could prove a result like 
Theorem 11.11 using a computer program. The main difficulties are technical 
rather than conceptual. The hard part is getting estimates that are sharp 
enough so as to lead to a feasible calculation. Our paper doesn't really have 
any dramatic new ideas. It just organizes things well enough to get the job 
done. 

We begin by eliminating some obviously bad configurations from consider- 
ation. For example we show that no two points in a minimizing configuration 
lie within 1/2 units of each other. This result holds for any power law po- 
tential. See Lemma 12.11 After eliminating these bad configurations, we are 
left with a compact configuration space Q of possible minimizers. 

We use stereographic projection to transfer the configurations on S 2 to 
configurations in C U oo. By keeping one point at oo and another on the 
positive real axis, we end up with a natural description of Q as the set 
[0, 4] x [—2, 2] 6 C R 7 . Working in f2, we define a natural way to subdivide a 
rectangular solid subset of Q into smaller rectangular solids. 

We use a divide-and-conquer algorithm to show that any minimizing con- 
figuration in Q must lie very close to the TBP. Assuming that Q is a rect- 
angular solid that does not lie too close to the point (s) of Q representing the 
TBP, we try to eliminate Q with a 3-step procedure. 
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• We eliminate Q if we can see, based on the fact that the regular tetra- 
hedron minimizes energy for 4 points, that no configuration in Q can 
minimize energy. This is described in §3.41 This method of eliminating 
Q cuts away a lot of the junk, so to speak, and focuses our attention 
on the configurations where are fairly near the TBP. 

• We eliminate Q if the configurations in it are redundant, in the sense 
that some permutation of the vertices or obvious application of symme- 
try changes these configurations to ones in a form that we deem more 
standard. See §U 

• If the first two methods fail, we evaluate the energy £ on the vertices 
of Q and then apply an a priori estimate on how far S differs from a 
linear function on Q. Our main result along these lines is Theorem 15. II 
This is our most powerful and general method of elimination, but we 
only use it when all else fails. 

If it is not possible to eliminate Q by any of our methods, we subdivide Q 
into smaller rectangular solids and try again on each piece. This algorithm 
is discussed, in outline, in §2.51 In the chapters following §2.51 we fill in the 
details. Theorem 15. II is the most subtle and important part of the paper. We 
prove this result in §5-8. 

The end result of our finite calculation is that the true minimizer of Q 
must lie very close to the TBP. See Lemma 12.31 To finish the proof, we 
use calculus to show that the TPB can be the only local minimum in the 
region where we have confined the minimizer. We do this by showing that 
the Hessian of £, the matrix of second partials, is positive definite throughout 
the region of interest to us. See Lemma 12.41 

1.4 Computational Issues 

We implement our computer program in Java, using interval arithmetic to 
control the round-off errors. We will discuss this in JLJJJ Our code only uses 
the basic operations plus, minus, times, divide, and sqrt, and these operations 
are performed in such a way as to avoid and arithmetic errors (such as taking 
the square root of a negative number.) It takes about 6 hours for our program 
to eliminate all configurations that are not within 2 -14 , in the L°° norm, of 
a suitably normalized version of the TBP. See Lemma 12.31 Without the 
interval arithmetic, the code runs in about an hour. 
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Theorem 15.11 is the rate-determining step in our calculations. Numerical 
experimentation suggests that our bound in Theorem 15.11 is off by a factor 
somewhere between 2 and 4. In light of this fact, our calculation probably 
ought to be about 10 times faster than it is. We can certainly improve 
Theorem 15. 1[ but we haven't been been able to think of a simple or dramatic 
improvement. 

A nice feature of our program is that we have embedded it in a graphical 
user interface. The reader can watch the program in action and see how it 
samples the configuration space, the reader can also manually construct a 
rectangular solid subset of the configuration and then see a printout of all 
the computational tests that are applied to it. This graphical aspect doesn't 
add anything to the formal proof, but it makes it less likely we have made a 
gross computing error. We have tried to isolate the relatively small amount 
of computer code that goes into the actual proof, so that it can be more 
easily inspected. 

The entire Java program is available from my website. See 
http://www.math.brown.edu/~res/Electron/index.html. The code 
is fairly well documented, and we're still working to improve the documen- 
tation. Aside from graphical support files, all the files involved in the proof 
have the Interval prefix. The directory contains a number of other files, 
which support other features of the program. Even though the proof portion 
of the code is done and working, the whole program is still somewhat in flux. 
I plan to gradually improve the code as time passes, and update it as I go. 
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2 Proof in Broad Strokes 



2.1 Stereographic Projection 

We find it convenient to work mainly with CUoo rather than on S 2 . Our 
reason for this is that a configuration space based on points in C has a 
natural flat structure, and lends itself well to a nice subdivision scheme. The 
subdivision scheme, which essentially amounts to cutting rectangular solids 
into smaller rectangular solids, feeds into our divide-and-conquer algorithm. 
All this is discussed in §2.51 below. 

We map S 2 to C U oo using stereographic projection: 



E(x,y,z) 



x 



1-z 



+ i 



y 



1 - z 



(3) 



£ is a conformal diffeomorphism which maps circles on S 2 to generalized 
circles in C U oo. A generalized circle is either a circle or a straight line. We 
have 

£(0,0,l) = oo. (4) 

Thus, £ maps a circle C C S 2 to a straight line if and only if (0, 0, 1) G C. 
The inverse map is given by 



^-\x + iy)= (- 



2x 



2y 



+ x 2 + y 2 1 + x 2 + y 2 



1 



ar 



(5) 



Let || ■ || denote the usual norm on R 3 . Here are two pieces of metric 
information we will use later on: 



oo 



(6) 
(7) 

Equations Eland [7] both have straightforward derivations, which we omit. 
Slightly abusing notation, we define 



cE- 1 




d^- 1 


2 


dx 




dy 


1 + x 2 + y 2 



E( Zl ,z 2 ) = E{T l -\z 1 ),T l - L {z 2 )) 



(8) 



Here E can be any energy potential, for points zi, z-i G C U oo. In this way, 
we can talk about the energy of a configuration of points in C U oo. 
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2.2 The Triangular Bi-Pyramid 

As in the introduction || - || denotes the usual norm on R 3 . Unless stated 
otherwise, all the distances we measure in i? 3 are taken with respect to the 
Euclidean metric. What we say here works for any energy potential. 
In the TBP, we have the following information. 

• One pair of points is 2 units apart. 

• 3 pairs of points are \/3 units apart. 

• 6 pairs of points are \/z units apart. 
Accordingly, the energy of the TBP, with respect to E, is 

M E = E(l/2) + 3E(V3) + 6E(y/2). (9) 

It is well known that the regular tetrahedron minimizes the energy for 
4 points. All points are a/8/3 units apart. For this reason, the regular 
tetrahedron has energy 

T E = 6E(y/8/3). (10) 

We can use this fact to get some crude bounds on 5-point configurations. 
Given a 5 point configuration P and a point p G P, we define 

e(p, p )= ^(Ib-sll)- (ii) 

q£P{p} 

We have the immediate estimate 

E{P) >E(P,p) + T E . (12) 

Thus, if P is an energy minimizer we must have 

E(P,p)<M E -T E ; VpeP (13) 

This is one of the criteria we will use to eliminate certain configurations from 
consideration. 

We can also use Equation [13] to give information about pairs of points 
within a minimizing configuration. We will consider this in the next section. 
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2.3 Estimates for the Power Laws 

The fairly weak results in this section are designed to give us some control on 
the size of the configuration space we must consider. For any given exponent, 
these results are just short calculations. It is only our desire to handle all 
exponents at the same time that adds complexity to the proof. 

Lemma 2.1 Let p, q%, q2 be 3 points of an energy minimizer with respect to 
a power law potential. Then ||p — > 1/2. 

Proof: Let E(r) = l/r e . Since E is decreasing, E(\\p — r||) > E(2) for all 
p,r G S 2 , because S 2 has chordal diameter 2. In light of Equation [121 this 
lemma is true provided that 

T E + 3£(2) + £(1/2) - M E > (14) 

When E is as above, our problem boils down to showing that 

0(e) = 2 l - e - 3 1 - 6 / 2 + 2 e - Z 1 -^ 2 + 2 1 -( 3e )/ 2 3 1+e/2 > 0. (15) 

This is an exercise in calculus. We compute 

0(0) = 0; 0'(O) > 0; 0"(O) > 1. (16) 

We compute 0"'(e) — A(e) — B(e), where 

A(e) = 31og(2) 2 2- 2 - £ / 2 + 2 e log(2) 3 + M^ 1 - 6 / 2 > 0; 

8 

B(e) = 2 l - e log(2) 3 + 2 - 2 -( 3e )/ 2 3 1+e/2 log(8/3) 3 < 2. (17) 
In short, 0"'(e) > —2. Taylor's theorem with remainder now tells us that 

e 2 p 3 

0(e) > y - 3 ■ (18) 
Hence 0(e) > for e G (0, 3/2). A similar computation for 0' shows that 

0'(e) > -10. (19) 

We compute that 

0(3/2 + j/1O)>1; j = 0,...,85. (20) 

Combining the last two equations, we see that 0(e) > for all e G [3/2, 10]. 
Finally, for e > 10, the result is obvious. 4 
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Lemma 2.2 Let p,q\,q2 be 3 points of an energy minimizer with respect 
to a power law potential. Assume that E{\\p — g 2 ||) < E(\\p — qi\\). Then 
\\p - gi|| > 1/2 and \\p — qr 2 1| > I. 

Proof: We have the inequality 

E(\\p - ^11) + E(\\p - q2 \\)<M E -T E - 2E(2). (21) 

Hence 

E(\\ P -<A)< ME - TE 2 - 2m . (22) 
Establishing this inequality boils down to showing that 

0(e) = 2 + 2 1 - 6 - (3)(2 1 - e/2 ) - 2- e - ^- e/2 + 2 1 -( 3e )/ 2 3 1+e/2 > 0. (23) 

This time we compute 

0(0) = 0; 0'(O)>O; 0"(O) > 1/4. (24) 

Examining the terms of 0'", as in the previous lemma, we find that 

0"'(e) > -12. (25) 

Taylor's theorem now tells us that > on [0, 1/16). A similar computation 
shows 

0'(e) > -10. (26) 

We now compute that 

0(1/16 + j/2000) > 1/200; j = 0, 1875. (27) 

Combining the last two equations, we see that 0(e) > for e G [1/16,1]. 
Next, we compute that 

0(1 +j/100)> 1/10; j= 0,...,900. (28) 

This shows that 0(e) > for e G [1,10]. For e > 10 the result is again 
obvious. 
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2.4 Planar Configurations 



We took the trouble to prove Lemmas 12.11 and 12.21 for any power law so that 
what we say in this section works for any power law. 

Basic Definition: Let P = {po,...,P4} be a configuration of 5 points on 
S 2 . Z = S(P) = {z , ...,z 4 } be the corresponding configuration in C U oo. 
We rotate S 2 so that 

Z4 = oo; zo € R+ E(zo,Zi) = max E(zi, Zj). (29) 

i<j 

The TBP: Now we discuss what the TBP looks like with this normalization. 
The TBP has two kinds of points. We say that the polar points are the two 
antipodal points in the configuration. We say that the three remaining points 
are equitorial. When z 4 corresponds to a polar point, we have the following 
configuration, which is unique up to the permutation of Zi, z 2 , z%: 

z = 1; z\ — exp(— 2m/3); z 2 = 0; z% = exp(27ri/3); 

(30) 

When Z4 corresponds to an equitorial point, we have 

z = 1; Z\ — -iv3/2; z 2 — — 1 z 3 = iy/3/2; (31) 

Later on, we will use a symmetry argument to avoid having to deal with the 
second of these configurations. 

The Space of Minimizers: We check 3 facts using Equation [6j 

1. If \z\ = 1 then IIE -1 ^) - S _1 (oo)|| = V2. 

2. If \z\ > 2 then \\^-\z) - E -1 (oo)|| < I. 

3. If \x\ > 4 then HS" 1 ^) - E _1 (oo)|| < 1/2. 

Suppose now that Z is a minimizer for some power law potential. It 
follows from Lemma 12.11 and Item 3 that \zj\ < 3 for j = 1,2,3. If follows 
from Item 1 and Lemma [4.21 that \zq\ > 1. We conclude that z G [1,4]. It 
follows Lemma \2. 21 and Item 2 that \zj\ < 2 for j = 1, 2, 3. We conclude that 
all the configurations we need to consider lie in the compact region compact 
region = [0,4] x [—2, 2] 6 C -R 7 . The map is given by just stringing out the 
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coordinates of the points z , z 1 , z 2 , z%. 



Dyadic Objects: Given squares Qi,Q 2 C C, we write Q\ — > Q 2 if Q 2 
is one of the 4 squares obtained by dividing Qi in half along both directions. 
We say that a square Q is a dyadic square if there is a finite 



The sides of Q are necessarily parallel to the coordinate axes, and the vertices 
have dyadic rational coordinates. We call the dyadic square normal if it does 
not cross the coordinate axes. A single subdivision of [—2, 2] produces normal 
dyadic squares. 

We make the same definition for line segments as for squares., except that 
the notation Si — > S 2 means that S 2 is one of the two segments obtained by 
cutting Si in half. We say that a dyadic segment is a line segment S such 
that there is a finite chain 



We say that a dyadic box in is a set of the form Q x Qi x Q 2 x Q 3 , where 
Qo is a dyadic segment and Qj is a dyadic square for j = 1,2,3. The whole 
space f2 itself is a dyadic box. 

Subdivision: Let Bj = Q j0 x Q jX x Q j2 x Q j3 be a dyadic box for j = 1, 2. 
We write B x ^r k B 2 if 

• Qu = Q21 for i ^ k. 

• Qlk — > Q2k- 

The kth subdivision of B\ is the union of all B 2 such that B\ — > k B 2 . When 
k = 0, this union consists of two dyadic boxes. When k = 1,2, 3, the union 
consists of 4 dyadic boxes. The set of all dyadic boxes in Q forms a directed 
tree. Each dyadic box points to 14 = 2 + 4 + 4 + 4 smaller dyadic boxes. 

The divide-and-conquer algorithm will be a depth-first search through 
this tree. It seems that the speed of the program depends a lot on how we 
do the subdivisions, so we will explain this in detail in the next section. 



[-2,2] 2 ^Qi^ ...^Q n = Q. 



(32) 



[0,4]^S 1 ^...^S n = S. 



(33) 
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2.5 The Divide and Conquer Algorithm 

Our discussion applies to a general power law potential, but we only apply 
the algorithm to the Coulomb potential. 

Let e > be some small number. In this section, we explain in the ab- 
stract how we show, with a finite calculation, that any winning configuration 
lies within e of one of the two TBP configurations described in §2.41 There 
are 5 components to our program: 

Confinement: We say that a dyadic box 

Q = Q x Q 1 x Q 2 x Q 3 . (34) 

is e-confined if, for each j = 0, 1, 2, 3, the dyadic square Bj is contained in 
the open square of side length e centered on the point Zj from Equation [301 

Tetrahedral Eliminator: We eliminate Q is if satisfies the hypotheses 
of Lemma 13.61 In this situation, every configuration in Q violates Equation 
[131 This function speeds up our calculations quite a bit. 

Redundancy Eliminator: We will isolate a subset Q' C Q with the fol- 
lowing property: If Z G Q — Q', then there is some Z' C Q' (obtained by 
permuting the points of applying some obviously energy-decreasing move) 
such that E(Z') < E(Z). In §4.41 we will describe some simple tests we use 
to show that Q G Q — Q'. We eliminate Q if it passes one of these tests. 

One convenient property of Q' is that it contains the configuration in 
Equation [201 but not the configuration in Equation [HU This means that we 
can automatically eliminate configurations near the one in Equation [3TJ This 
is why our notion of e-confinement only mentions the configuration in Equa- 
tion M 

Energy Estimator: Now we come to the main point. Say that an en- 
ergy estimator is a function $ : S — > R such with the following property. 
For every configuration Z = {z , z^} with Zj G Qj, we have 

E{Z) > $(Q). 

See Theorem 15.11 for the definition of our Energy Estimator. Theorem 15.11 is 
our main technical result. 
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Depth First Search: Recall that M e is the energy of the TBP. Our pro- 
gram maintains a list L of dyadic boxes. Initially, C just has the single box 
fl, the whole space. At a given stage of the program, the algorithm examines 
the last box Q and eliminates it if one of three things happens: 

1. Q is e-confmed. 

2. Lemma [3.61 eliminates Q. 

3. Onf the the tests in §4.41 eliminates Q. 

4. *(Q) > M e) . 

Otherwise, Q is eliminated and to C we append the dyadic boxes in the kth 
subdivision of Q for some k G {0,1,2,3}. We will explain how k is deter- 
mined momentarily. The algorithm halts if £ is the empty list. In this case, 
we have shown that, up to symmetries, any minimizer lies within e of the 
TBP. 

Subdivision Rule: The error term in Theorem 15. II has the form 

3 

i=0 jyti 

Here t(Qi,Qj) is a quantity that depends on the geometry of Qi and Qj. 
Roughly, it varies quadratically with the side length of Qi. We write 

e(i) = X>(^')- (36) 

Then, again, e(i) depends roughly quadratically on the side length of Qi. We 
find the index k that maximizes the function % — )■ Q{%) and then we use the 
fcth subdivision rule. Thus, we subdivide in such a way as to try to make the 
error estimate in Theorem 15.11 as small as possible. When we compare this 
method with a more straightforward method of subdividing so as to keep the 
squares all about the same size, our method leads to a vastly faster compu- 
tation. 

Remark: The main difficulty in our proof is choosing an energy estimator 
that leads to a feasible calculation. We found the subtle Energy Estimator 
from Theorem 15.11 after quite a bit of trial and error. 
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2.6 The Main Results 



Let 8 denote the energy function on Vt. For any s > let f2 s denote those 
configurations {zk} so that Zk is contained in a square of side length s centered 
at the kth point of the TBP, normalized as in Equation [301 We shall be 
interested in the cases when 

s = 2~ n . (37) 
Running our computer program, we prove the following result. 

Lemma 2.3 (Confinement) Any Coulomb energy minimizer in Q has the 
same energy as some configuration in f2 s /4. 

Let H e denote the Hessian of 8, with respect to the potential E{r) = r~ e . 
In §H]we will prove the following result. 

Lemma 2.4 For e = 1, 2 and for any W e Q s , the matrix H e (W) is positive 
definite. 

Now let Zi be a 5-point minimizer for the Coulomb potential. By the 
Confinement Lemma, we can find a new configuration Z2 G Vt s with the 
same energy. But 8 has positive definite Hessian throughout Q s , and (by 
symmetry) the gradiant V£ vanishes at Zq, the TBP. Restricting 8 to a 
straight line segment connecting Zq to Z2 we see that 8 is a convex function 
with a local minimum at Zq. Hence 8{Z-i) > 8(Zq). This proves Theorem ll.il 

Remarks: 

(i) Our program also establishes the Confinement Lemma for the function 
E(r) = r~ 2 . The same argument as above now establishes Theorem 11.21 

(ii) We didn't need to compute all the way down to fi s / 4 . We could have 
stopped at Q s in the Confinement Lemma. However, an earlier version of 
this paper had a weaker result on the Hessian, and we did the extra comput- 
ing to accomodate this. There doesn't seem to be any reason to throw out 
our stronger computaional result since we (or, rather, the computer) took 
the trouble to get it. 
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3 Separation Estimates 



3.1 Overview 

Let E be stereographic projection. The sets of interest to us have the form 

Q* = E-^Q). (38) 

where Q is either a dyadic segment or a dyadic square or the point oo. We 
will call Q a dyadic planar set and Q* a dyadic spherical patch. A dyadic 
spherical patch is either the point (0, 0, 1), or an arc of a great circle, or else 
a subset of the sphere bounded by 4 arcs of circles. 

Let (Qi,Q2) where Qj is a dyadic planar set. In this chapter we are 
interested in the following two quantities. 

Vw(<2i,<22) = max||pi -p 2 ||; Pj e Q*. (39) 

VVin(<2i,<22) = min ||px -p 2 ||; p 3 - G Hull(Q*). (40) 

Here Hull(Q*) is the convex hull of Q*. For technical reasons the lower bound 
we seek need to work for a slightly larger range of points. 

In this chapter, we will estimate i/)(Qi, Q2) in terms of quantities that can 
be determined by a finite computation. At the end of the chapter, we will 
give, as an application of these estimates, the definition of the Tetrahedral 
Eliminator discussed in §2.51 

In the next definitions, we set z = x + iy. If Q is a dyadic square or 
dyadic segment, we define 

a: = min|x|; x = max|x|; y = mm\y\; y — maxM; 

5 = ^~f y y r = ^MQ). (41) 

These quantities depend on Q but we usually suppress them from our nota- 
tion. Here x — x is just the sidelength of Q. The quantity 5 is an estimate on 
the side length of Q*. Note that r = 1 if Q is a dyadic segment and r = v^2 
if Q is a dyadic square. 

When Q — {oo}weset5 = r = and we don't define the other quantities 
know Q* exactly in this case. 
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Given a pair (Q 1 , Q 2 ) of dyadic objects, we 

DHIs-'M-s-'fe)!!; s = s -^±^. (42) 

Here zj is the center of Qj and 5j = 6(Qj), as defined above. Here is our 
main result. 

Lemma 3.1 (Bound) The following is true relative to any pair of dyadic 
patches. 

> D(l - 5 2 /2) - (VA^)5; </w < D + (Va^)5. 

Remark: Our proof will yield the better bounds 

> D cos(5) - (V4-D 2 ) sin(5) < D cos(5) + (VI - D 2 ) sin(<5) , 

but for computational reasons we want to avoid the trig functions. We have 
used rational replacements which are quite close in practice to the trig func- 
tions. 

There is one situation where we can get a completely sharp upper bound. 
We define 

^maxWi^z) = max \\px -p 2 ||; pj e Q*. (43) 

This time we mazimize over pairs (pi,P2), where pj is a vertex of Q*. A 
finite computation gives i/;^. We define ip' min similarly. Recall that a dyadic 
square is normal if it doesn't cross the coordinate axes. 

Lemma 3.2 (Perfect Bound) // Qi is normal and Q2 = {00} then we 
have V>max = ^max and Vw = 4>' m m- 

We define \l/ m in to be the best bound we can get from the two lemmas 
above (or 0, if no lemma applies.) We define ^/ max to be the best bound we 
can get from the two lemmas above (or 2, if no lemma applies.) 

The rest of the chapter is devoted to proving Lemma 13.11 and Lemma 13.21 
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3.2 Proof of Lemma I3TT1 

Lemma 3.3 Let A and B be arcs of the unit circle. Let Aa and A B denote 
the arc lengths of A and B Let Da and Db denote the distance between the 
endpoinds of A and B respectively. Let 

= Aa-A b 



Then 



D B = D A cos(6) ± (^4 - L>|) sm(8). 



Proof: We have the relations 

D A = 2 sin(A A /2); D B = 2 sin(A B /2) = 2 sin(A j4 /2 - 5). (44) 
Hence, by the angle addition formula, 

D B = D A cos(5) - 2 cos(sin- 1 (£) A /2)) sin(<J). (45) 

Noting that 

cos(sin _1 (a;)) = Vl — x 2 , (46) 
we see that Equation H5] is the same as the result we want. 4jk 



Lemma 3.4 Let Q be a dyadic set. Every point of Q* is within 5t/2 units 
ofX-\z Q ). 

Proof: When evaluated at all points of Q, the quantity in Equation [7] is 
maximized at the point (x,y). Its value at that point is exacrly 



1 + x 2 + y 1 



Therefore S 1 expands distances on Q by at most a factor of 5/s, and every 
point of Q is within sr/2 of z. From here the result is obvious. 4 

Now we give the main argument for Lemma 13.11 We consider the lower 
bound first. Given a point p G S 2 and some r > we let H(p, e) C S 2 denote 
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the convex hull of the set of points on S 2 that are within e of p in terms of 
arc length on S 2 . We call such a set a cap. 

It follows from Lemma 13.41 and the convexity of caps that 

Hull(Q*) C Hj = H{ Pj , SjTj/2). (47) 

Here pj = ^^(zj), where Zj is the center of Qj. 

Suppose first that H 1 and H 2 are disjoint. Let (qi, q 2 ) G Hi x H 2 be two 
points which realize the minimum of \\qi — q 2 \\. We must have qj G dHj. 
Also, the segment joining q 1 to q 2 must be perpendicular to both dHi and 
dH 2 . This situation leads to the result that q± and q 2 are contained on the 
great circle C joining p\ and p 2 . 

We can now reduce everything to a problem in the plane. Let IT be the 
plane containing C. We identify II with C, so that C is the unit circle. Figure 
3.1 shows the situation. We have drawn the case when both intersections lie 
in a half-disk, but this feature is not a necessary part of the proof. 




Figure 3.1: Two circular caps. 



Let A be the short circular arc joining p\ and p 2} and let B be the short 
circular arc joining q\ and q 2 . Referring to Lemma [3 .3[ we have 

D A = D(Q 1 ,Q 2 ) = \\ Pl -p 2 \\; (48) 

Here D = D(Qi, Q 2 ) is the quantity in the statement of the lemma. We also 
have 

VVin < D B = \\qt- q 2 \\. (49) 
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Finally, the length of the circular arc joining pj to qj is 5jTj/2. Therefore 

5(A, B) = A A -A B = W2 + W2 = S(Q U Q 2 ). (50) 

Applying Lemma I3.3[ and using our three equations, we get 

^min > D cos(5) - Ufi - L> 2 ) sin(5) (51) 

But cos(5) > 1 — 5 2 and sin(<5) < 5. This gives us the lower bound from 
Lemma [3.11 in the case the caps are disjoint. 

When the caps intersect, we have -?/>min = 0. We just want to show that 
the lower bound in Lemma 13.11 is nonpositive. We want to reduce this case 
to the previous one. Choose some small e > and consider smaller caps H[ 
and H' 2 that are separated by a distance of exactly e. These two caps are 
based on some number 5' < 5. Our argument in the previous case gives 

e > D(l - (5') 2 /2) - (VA-D^d' (52) 

D(l - (6') 2 /2) - (yA^D^b' > D(l - 5 2 /2) - (Vl^D^. (53) 

Combining these two results, we see that the lower bound in Lemma 13.11 is 
less than e. But e is arbitrary. Hence, the lower bound in Lemma 13.11 is 
nonpositive in this case. This completes the proof of the lower bound. 

The proof of the upper bound is similar. Suppose first that Hi n H 2 does 
not contain a pair of antipodal points. Then, by Lagrange Multipliers, the 
pair of points (91,52) realizing the maximum lie on the great circle through 
Pi and q 2 . We then rotate as above and apply the same argument. This time 
we get 

< D cos(5) + (VI - D 2 ^ sin(5) (54) 

But cos(5) < 1 and sin (5) < 5. This gives us the lower bound from Lemma 
13.11 in the case Hi U H 2 does not contain a pair of antipodal points. 

If Hi U H 2 contains a pair of antipodal points then one of two things is 
true. If pi and p 2 are antipodal points, then obviously the upper bound in 
Lemma l37fl gives a number greater than 2. If pi and p 2 are not antipodal we 
can use the same shrinking trick that we used in the previous case to show 
that the upper bound in Lemma 13.11 gives a number that is at least 2. In 
either case, the upper bound still holds. 

This completes the proof of Lemma 13 . 1 1 



20 



3.3 Proof of Lemma [3721 



Lemma 3.5 Suppose that Q\ is a dyadic segment or square and Q2 = {00}. 
Let p 2 = (0,0, 1). Then, for point p 1 G Q\ we have ijj' miD < \\pi — P2II < V'max- 
In particular, the upper bound in Lemma \3.S\ is true. 

Proof: Consider the lower bound first. The point Z\ G Q\ which minimizes 

IIE-^zO-E-^oo)!! (55) 

is the point furthest from the origin. But, since disks are convex, the point 
of Q\ farthest from the origin is a vertex. 

Now for the upper bound. The point of Q± that maximizes Equation [551 is 
the one closest to the origin. If a vertex of Qi does not minimize the distance 
to the origin, then (by Lagrange multipliers) some ray through the origin in- 
tersects a side of Qi at a right angle. Since the sides of Qi are parallel to 
the coordinate axes, this can only happen if Qi crosses one of the coordinate 
axes. But Qi does not cross the coordinate axes. 4* 

Lemma 13.51 immediately gives the upper bound. For the lower bound 
(which makes a different statement) we need to deal with all the points in the 
convex hull Hi = hull(QJ). Any point q G H\ that minimizes \\q — (0,0, 1)|| 
must lie on dH\. We claim that dHi is the union of the following 

• QI- 

• The flat quadrilateral F that is the convex hull of the vertices of Q\. 

• The convex hulls H(Ej) of the edges Ei, E4 of Q\. 

Each of these sets is a subset of Hi and we easily check, in each case, that 
every point in each set is a boundary point. Since the union of these sets is 
a topological sphere, it must account for the entire boundary. 

Our lemma above takes care of the case when q G Q*. If q is an interior 
point of F, then the segment joining (0, 0, 1) to q is perpendicular to F. But 
F is completely contained in a hemisphere that has (0, 0, 1) on its boundary, 
so no such segment can exist. 

Finally, suppose that q G H(Ej). Note that H(Ej) is contained in a plane 
IT which also contains (0, 0, 1). But, considering the distance minimizimation 
problem in IT, we see that q cannot be an interior point of H(Ej). (The 
picture looks just like the one drawn in Figure 3.1.) But then either q G QI 
or q G F, the two cases we have already handled. This completes the proof. 
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3.4 The Tetrahedral Eliminator 



Let Q be a dyadic box, as in the previous section. As in Equation [131 the 
quantity Me is the energy of the TBP and the quantity Te is the energy 
of the regular tetrahedron. Let \l/ m ax be the upper bound on the distance 
between Q\ and Q\ as in §3.11 

Lemma 3.6 (Tetrahedral Eliminator) Let Q be a dyadic box. Suppose 
that there is some index i such that 

E(d tJ ) > Me — T E ; d tj = ^ max (Qi, Qj). (56) 

Then no configuration in Q is a minimizer for the E-energy. 
Proof: By construction \\pi~ Pj\\ < dij for all pi G Q* and pj G Q*. Therefore 

E(P, Pi ) > J2 E ( d ^ >M E -T E (57) 

for any configuration P corresponding to a point in Q. But then every such 
configuration violates Equation [T3] and cannot be a minimizer. 4k 

3.5 Discussion 

We can get a cheap Energy Estimator using the fact that 

8{Z) > ^£(e%); d l3 = V max (QhQj)- (58) 

i<j 

However, experiments lead us to believe that the main calculation would 
effectively take forever using this estimator. 

Here is the problem: Even though E(dij) gives a good estimate for the 
minimum energy of a pair of points (pi,Pj) G Q* x Q*, there is no guarantee 
that we can find a single configuration {pj} such that each dy is nearly 
realized by -E(||pi — Pj\\). So, the minimum energy of a configuration we 
can actually produce might be much higher than our estimate. That is, our 
estimate might not be that good globally (for all 10 interactions) even though 
it is good locally (for pairs of interactions.) We need to work harder to get a 
globally good energy estimator. This is the content of Theorem 15.11 
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4 The Redundancy Eliminator 



The constructions in this chapter work for any power law, and most of the 
constructions (just symmetry) work for any decreasing energy function. 

4.1 Inversion 

An inversion is a conformal involution p of C U oo that fixes some circle C 
pointwise. We call p stereo-isometric if S _1 (C) is a great circle of S 2 . This 
condition means that stereographic projection conjugates p to an isometry of 
S 2 . The purpose of this section is to prove the following lemma. The reader 
might want to just note the result and skip the proof on the first reading. 

Lemma 4.1 Let p be a stereo-isometric inversion fixing a circle centered on 
a point of [l,oo) C R. Let X C C be a strip bounded by horizontal lines, 
one lying above R and one lying below R. Let X_ C X denote those points 
z = x + iy with x < 1 — sfl. Then p(X_) C interior (X) — X_. 

Proof: Being an inversion, the map p has 3 nice geometric properties. 

• p interchanges the disk D bounded by C with its complement. 

• p interchanges the center of C with oo. 

• p preserves each ray through the center of C . 

We will use these properties in our proof. 

Let zq be the center of C. We claim that C separates zq from any point 
on R that lies to the left of 1 — Let po = X" 1 ^). As usual, (0, 0, 1) = 
S _1 (oo). Let II be the plane equidistant between p and (0,0, 1). Then 



Let H be the half-space bounded by II that contains p . 

The point p lies on the great circle connecting S _1 (l) = (1,0,0) to 
(0,0,1). Since Zq G [1,oo), the point po lies on the short arc connecting 
(1,0,0) to (0,0, 1). But then 



c = ^(uns 2 ) 



(59) 




^ interior (H). 



(60) 
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The extreme cases occurs when p = (1,0,0). In this extreme case q e IT PIS' 2 . 
As p moves towards (0, 0, 1), H fl S 2 moves away from q. Since H does not 
contain (0, 0, 1) either, the entire arc connecting q to (0, 0, 1) is disjoint from 
the interior of H. But this means that C separates zq from all points of R 
that lie to the left of 1 — \/2. This proves our claim. 

Let D be the disk bounded by C that contains zq. From what we have 
already shown, the interior of D is disjoint from the vertical line L through 
1 - V2. Therefore D n X_ = 0. Since p swaps D and its complement, we 
have 

p{X.) C D. (61) 

Hence 

p(X_) n X_ = 0. (62) 

At the same time, p preserves all rays through the Zq, the center of D. 
Noting again that zq e R, we see geometrically that p(w) is closer to R than 
w for all w G X — D. See Figure 4.1. In particular, we have 

p{X_) C interior (X). (63) 

The first statement of the lemma now follows from Equations [62] and [63j The 
second statement follows from the first statement and from the fact that p 
is an involution. 4 





L 


/D 
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Figure 4.1: Inversion 



24 



4.2 A Lemma about Five Points 



The result in this section is certainly well known. We suggest that perhaps 
the reader just note the result and skip the proof on the first reading. 

Lemma 4.2 Let P be any configuration of 5 distinct points on S 2 . Then 
some pair points of P are within \/2 units of each other. 

Proof: A quarter sphere B is a region on S 2 bounded by two semicircular 
great arcs, meeting at antipodal points, such that the interior angles at the 
intersection points are ir/2. The axis of B is the circular arc, contained in 
B, that connects the midpoints of the two bounding arcs. We will use the 
following easy-to-prove principle: If p is any point on the axis of B, then the 
closed hemisphere centered at p contains B. Call this Property X. 

The lemma has a trivial proof if P contains two antipodal points, so we 
assume this is not the case. Let Hj be the closed hemisphere centered at pj 
of P. Another way to state this lemma is that there are indices % ^ j such 
that pi G Hj. Assume this is false, for the sake of contradiction. 

Let N denote the hemisphere centered at (0,0, 1), and let S denote the 
opposite hemisphere. We normalize so that p = (0,0,1). By assumption, 
Pi, ...,P4 G S — N. Since P does not contain antipodal points, P does not 
contain (0,0, —1). But then, for j = 1,2,3,4, there is a unique point p* on 
the equator N n S which is is as close as possible to pj. 

Let H* denote the hemisphere centered centered at p*. By construction, 
H* D S is a quarter sphere and pj is a point on its axis. By Property X, 
we have H* D S C Hj. But then, by assumption, p { £ H* Hi ^ j. Let 
7r : S — > R 2 be the projection map n(x,y,z) = (x,y). Then n(S) is the unit 
disk and n(S fl H*) is a half disk. Finally, n(pi) and n(p*) lie on the same 
ray through the origin. For this reason, p { G H* if and only if p* G H*. Since 
Pi $l H*, we conclude that p* G" H*. 

Now we know that p* G" H* Hi ^ j. Hence, the distance from p* to p* 
is greater than n/2. But then we have 4 points p*, ...,pl, contained on the 
equator, each of which is more than 7r/2 from any of the other points. This 
is a contradiction. 4 
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4.3 The Set of Good Configurations 

Recall that Q is our configuration space. Our construction here works for 
any energy function E. In this section we define the set Q' C Q, as discussed 
in §2.51 in connection with the Redundancy Eliminator. 

Let Z = {z , Z4} be a configuration of points in C U 00, normalized as 
in §2.41 We write Zj = Xi + Hi- Also, we set pj = E -1 ^-). Let Q' be the set 
of configurations Z having all the following properties. 

1- \\Pi ~ Pj\\ < \\Po — Pi\\ f° r a ll indices i 7^ j. Also x > 1. 

2. yi < < y 2 < y 3 . 

3. If yt < < 2/3 then x 2 > I - V2. 

Lemma 4.3 For any Zefi, there exists Z' 6 f2' s«c/i t/iat E(Z') < E(Z). 

Proof: Permuting the points, we can the first half of Property 1. By Lemma 
14.21 and the first half of Property 1, we must have ||po>2?4|| < v2- But this 
fails if zq G [0, 1). Hence xq > 1. In short, the first half of Property 1 implies 
the second half. 

Reflecting in R, we can arrange that y 2 > 0. Permuting again, we can 
retain Property 1 and arrange that y\ < y 2 < 2/3. To get Property 2, we just 
have to deal with the situation when < y±. Let Z' be the new configuration 
obtained when we replace zi with the conjugate zi and otherwise keep the 
points the same. Let P and P' be the corresponding configurations in S 2 . 
The points p , ...,p 5 are all contained in the hemisphere bounded by the great 
circle 

C = £ -1 (i2Uoo). (64) 

The configuration P' is obtained from P simply by reflecting p± across C. 
But, as can be seen from e.g. the Pythagorean theorem, we then have \\p[ — 
Pj\\ > ||pi — Pill for j /3. The other distances do not change. Since E 
is a monotone decreasing function, we have E(Z') < E(Z). This gives us 
Property 2. 

It remains to deal with Property 3. Suppose that y\ < < y 3 and 
x 2 < 1 — V2. Let p be the stereo-isometric inversion that swaps z and z 4 . 
Let Z' = p(Z). Let z'- = p(zj). We set z'- = x'j +iy'j- By construction Z' has 
property 1. By symmetry, 

y[ < < y> 3 ; y' 2 > 0. (65) 
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There are two cases to consider. 



Case 1: Suppose first that 

2/2 < 2/3 (66) 
Equations [65] and [66] combine to say that Z 1 has Property 2 as well. Let 
X be the strip bounded by the horizontal lines through z\ and z^. Then X 
satisfies the hypotheses of Lemma 14.11 Moreover, z 2 G X_, the half-strip 
from Lemma 14.11 Then z' 2 E X — X_ by Lemma 14.11 Hence x' 2 > 1 — y/2. 
This shows that Z' has Property 3. Hence Z' G fi'. 

Case 2: Suppose that y' 2 > y 3 . Then we let Z" be the configuration ob- 
tained from Z' by swapping z' 2 and z' 3 . Then satisfies Properties 1 and 
2. The argument in Case 1 again shows that z 2 G X — X_. In particular, 
^2 £ X. By construction, 

z 3 ^ interior (X). (67) 



By Lemma [4. 1[ we have 

p{XS) C interior(X). (68) 
Combining these last two equations, we see that 

Zz<£p{X). (69) 
Since p is an involution, this last equation gives us 

4£X_. (70) 
Summarizing the situation, we now know that 

z' 2 ex- z' 3 <£X_. (71) 
Since < y' 3 < y 2 and z 2 e X, we have 

4 G ^- (72) 

Combining the last two equations, we have 

4 G X - X'; 4' = 4 > 1 - v 7 ^. (73) 

The first statement implies the second. The second statement shows that Z" 
has Property 3 as well. Finally, E(Z") = E(Z). 4jk 
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4.4 The Main Construction 



Now we define the Redundancy Eliminator discussed in §2.51 The redun- 
dancy eliminator performs 4 tests, one per property discussed above. Let 
Q = Qo x Qi x Q2 x Q3 be a dyadic box. As usual, a configuration Z e Q 
defines points p , ...,p 4 , with pj e Q^. 

Property 1: Let ^/ max and \l/ m in be the separation functions from §3.11 
We eliminate Q if 

^m ax (Qi, Qj) < ^min(<2o, Qi) (74) 

for some pair of indices i < j such that 7^ (0,4). We also eliminate Q 
if x < 1. 

Property 2: Let a^, etc be as in §3.11 We eliminate Q if any of the following 
is true. 

• V x > U21 

• y 2 > Us- 

• y 2 < 0. 

• y x > 0. 

In the first 3 cases, the fact that we are using a weak inequality rather than 
a strict inequality means that sometimes we eliminate some configurations 
that lie in dVL' . Since we want to consider every configuration in Q', we need 
to justify this. We give the justification in the next section. 

Property 3: We eliminate Q if all of the following happen. 

• Vx < 0. 

• y 3 >0. 

• x 2 < 1 - V2. 

This time there are no boundary cases to worry about. 
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4.5 Discussion of Boundary Cases 

Here we discuss the use of weak inequalities in connection with Property 2 in 
the previous section. First of all, the reason why we want to do this is that it 
speeds up the computation. For example, were we to keep dyadic boxes with 
y 2 = we would need to consider many more dyadic boxes that are near the 
TBP. 

The justification for why we can use weak inequalities is that all the 
configurations in dfl' that we eliminate are actually counted twice, and we 
do not eliminate the relevant dyadic boxes both times. 

To clarify the situation, we make the interpretation that our dyadic 
squares Q 2 and Q 3 are missing their top boundaries and Qi is missing its 
bottom boundary. With this convention, the divide and conquer algorithm 
from §2.51 examines every point in the configuration space except those for 
which \yj\ = 2 for some j. These configurations are not minimizers. See the 
discussion in §2.41 The main point here is that the union of "quarter-open" 
dyadic squares in the subdivision of a "quarter open" dyadic square is still 
equal to the original "quarter open" dyadic square. In the case of Q 2 and 
Qs, the bottom edges fill in for the top edges. In the case of Qi the top edges 
fill in for the bottom ones. 

With this interpretation, we can simply eliminate the dyadic box Q men- 
tioned above, because it contains no configurations in Q'. Adopting this 
convention has no effect whatsoever on our program. It is simply a question 
of how we interpret the output of the program. 
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5 The Energy Estimator 



5.1 Preliminaries 

We fix some value of n. Say that a block Q is a collection Qo,...,Q n of 
dyadic objects, with Q n = {00} and all the other objects either segments or 
squares. Say that a collection of points z ,...,z n of points is dominated by 
the block if 2% G Qk for all k. In our application, we will take n — 4. In this 
case, the set of configurations dominated by a block is precisely a dyadic box. 

Remark: Though we always take Qk to be a dyadic segment or square 
for k = 0,...,n — 1, the interested reader will note that everything we do 
works the same way with the milder constraint that each Qk is either a seg- 
ment in [0, 00) or a square with sides parallel to the coordinate axes that 
does not cross the coordinate axes. 

We say that {zk} is a vertex configuration if Zk is a vertex of Qk for 
k = 0, ...,n. There is a finite list of vertex configurations. Given an energy 
function E, we define 

S(Q) = mm E(Z). £'(Q) = mmE(Z'). (75) 
z z 

The first minimum is taken over all configurations Z dominated by Q and 
the second minimum is taken over all vertex configurations Z' . The purpose 
of our main result in this chapter is to bound the quantity 

ERR(Q) = £'{Q) - S(Q) (76) 

in terms of Q. 

We are interested mainly in the case when the function E is a power law, 
but we will state things more generally just to clarify the formulas we get. 
We suppose that E is convex decreasing and satisfies the following property. 
If < r 2 < r\ then there are constants c > 1 and h > (depending on 
everything) such that 

= . E"{r 2 ) = {c + h)E"{r 1 ). (77) 

r 2 r 1 

This condition could be stated more simply, but we have stated exactly the 
version we will use in Lemma T6.4[ the one place where we use it. Other than 
this one place, the rest of our argument works for any convex decreasing 
function. The power law functions all satisfy these conditions with h = 0. 
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5.2 The Main Result 

We defined \l/ m i n in §3.11 Given two dyadic objects Q and Q, we define 

# = *min(Q,Q). (78) 

When R = 0, our bound gives oo, a result that holds no matter what. So, 
without loss of generality, we treat the case when R > 0. In this case, the 
two sets Q* and Q* are contained in disjoint convex sets. 
We define 

e(g,Q) =max(0, Ax) + A 2 . (79) 
When Q is a dyadic segment, 

A -l B 'w+G-S)^ < 8 °> 

When Q is a dyadic square, 

Ai = X E '(R) + (±-^-)E"(R); A 2 = -^fv / T+^+v / T+F 
16 V ; V4 16/ V ; 7.98 \ v y 

(81) 

Let Si be as in EquationUU relative to Q 4 . Recall that <5j is a good estimate 
for the side length of of the spherical patch Q*. Also, let = e(Qi, Qj). 

Theorem 5.1 (Energy Estimator) Let Q = (Qo,---,Q n ) be any block. 
Then 

n-l 

Again, we remark that the case n = 4 is the case of interest to us. For 
reference, we work out the power-law case E(r) = r~ e explicitly. When Q is 
a dyadic segment, 

A -±±A e ( e + 2 ) . a _ e ( z 2) 

Al " -JR^ 32^' A2 " 8iF^ (82) 

When Q is a dyadic square, 

A -^(^+v/TTF) (83, 
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5.3 The Beginning of the Proof 

Recall that a partition of unity is a collection of functions that sum to 1 at 
every point. A point weighting assigns a partition of unity 

A a6 :Q->[0,l]; a, b E {0,1} (84) 

to each dyadic square Q and a partition of unity 

A a :Q-»[0,l]; a G {0, 1} (85) 

to each dyadic segment. We have a specific point weighting in mind, and we 
will define it in £JB1 

We let the vertices of a dyadic square Q be Q ab for a, b G {0, 1}. Here Q o 
is the lower left vertex and the remaining vertices, traced in counterclockwise 
order, are Qiq, Qn, Qq±. We let the vertices of a dyadic segment Q be Qo 
and Qi, with Qo on the left. 

Given two points z, w G C U oo, we define 

/( ^ ) = ii S -^)-V>)ir (86) 

Let X denote the set of disjoint pairs (Q,Q), as in the previous section. 
Let e : X — )■ R be as in the previous section. We will spend the next 3 
chapters proving the following result. 

Lemma 5.2 There exists a point weighting such that the following is true 
for all (Q,Q) G X. 

• When Q is a segment 

(j2K(z)f(Q a ,w))-f(z,w) < e(Q, Q) 5{Q) 2 - W(z,w) eQxQ. 

a 

• When Q is a square 

( £ X ab (z)f(Q ab , w)) - f(z, w) < e(Q, Q) 6(Q) 2 ; W(z, w)eQxQ. 

a,b 
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Proof of Theorem I5.lt For ease of exposition, we will assume that the 
dyadic objects Q , ...,Q n _i are all dyadic squares. The case when there are 
some segments involved presents only notational complications. Consider 
a configuration z Q ,...,z n , domainated by Q, that realizes S(Q). For each 
2 = 0, n, define 

Bi = {z } x {Zi-i} xQiX ... x Q n . (87) 

Just to be explicit, the two extreme cases are 

B = Q x ... x Q n ] B n = {z } x ... x {z n }. (88) 

(Recall that Q n = {z n }.) The set Bi is a (2n — 2z)-dimensional rectangular 
solid. Let £\{i) denote the minimum energy taken over all configurations of 
Bi. Let £2(2) denote the minimum energy over all vertex configurations of 
Bi. The conclusion of this lemma is exactly 

n-1 

£ 2 (0) - £i(0) < E e iAQiT (89) 

We have £ 2 (n) = £i(n), because B n is a single point. Hence, we have the 
telescoping sum 

n-1 

£ 2 (o) - £1(0) = - £1 w) - (s 2 (i + i)-e 1 (i + 1)) (90) 

i=0 

Our choice of {zi} as a minimizing configuration implies that £\{i) is 
independent of i Combining this with Equation [901 we get 

n-1 

£ 2 (0) - £i(0) = Y,e 2 (i) - £2(1 + 1). (91) 

i=0 

To establish Equation [891 h suffices to prove 

S 2 (i) -S 2 (i + l) <^%^; Vi = 0, ...,n - 1. (92) 

For the remainder of the proof, we fix some i G {0, n— 1} once and for 
all. We can find vertices fj+i, ...,v n of Qi+i, ■••> Qn respectively such that 

E(z 1 , Zi, v i+1 , ...,v n ) = £ 2 (i + 1). (93) 
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Here E(...) is the energy of the configuration. Note that Q n is a singleton, 
so that our "choice" of v n is forced. 

For any choice of a, b G {0, 1}, let Qi ab be the corresponding vertex of Qi. 

Let 

Kb = Kb(zi) (94) 

be as guaranteed by Lemma 15.21 The function X ab depends on i, but we 
suppress this from our notation. 

Since £2(2) is the minimum energy of any vertex configuration of Bi, 

E(zi, Qiab, v i+ i, v n ) > £2(1). (95) 

Hence, 

£ 2 (i) - £ 2 (i + 1) < Kb] 
A ab = E(z 1 ,...,z i -i,Qiab,v i+1 ,...,v n ) - E(zi, ...,Zi,v i+ i, ...,v n ). (96) 
Setting Wj = Zj for j < i and Wj = Vj for j > i, we have 

£ 2 (i) - £ 2 (i + 1) < 

KbA a b =1 

^fiQiabWj) ~ f(Zi,Wj 
a,b jj^i 

^2^2 Kb (Qiab, Wj) - ffaWj) 
j^i a,b 

Kbf{Qiab,Wj)^ - f(Zi,Wj) < 



(97) 

The first inequality comes from Equation [96] and from the fact that 
^2 X a b = 1- Equality 1 comes from the cancellation of all terms not in- 
volving the ith index when we subtract the two sums for A ab . Equality 2 
comes from switching the order of summation. Equality 3 comes from the 
fact that Ylab ^ab — 1 • The last inequality follows from Lemma 15.21 This 
establishes Equation [521 which is all we need to prove Theorem 15.11 6 
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6 An Estimate for Line Segments 



6.1 The Main Estimate 

In this chapter we prove an estimate that relates directly to the Ai term in 
our Energy Estimator. The reader might want to simply note the main result 
in this chapter on the first reading, and then come back to the proof later 
on. Let E be as in the previous chapter. 

Let p G S 2 be some point. In terms of Lemma T5.21 we think of p as being 
some point in the spherical patch (Q)*. Let A' C -R 3 — {p} be a segment 
whose endpoints lie in S 2 . In terms of Lemma [5.21 we think of A' as joining 
two boundary points of the spherical patch Q*. We define 

F(q) = E(\\p-q\\) (98) 

Here F depends on p, but we suppress this from our notation. 
Define 

A' x = (1- x)A' + xA[; xe[0,l]. (99) 
Then x — > A' x is a constant speed parameterization of A'. 

Lemma 6.1 (Segment Estimate) Suppose that p is at least R units from 
every point of A' . Let S be the length of A' . 

(l-x)F(A' ) + xF(A' 1 )-F(A' x )<^, 

where 

X = ^E>(R) + (l-^-)E"(R). 

Remark: We wish to point out one unfortunate feature of our notation. 
The quantities E' and E" are derivatives of E whereas the quantity A' is 
simply a chord of S 2 . We make this notation because, in the next chapter, 
we will consider an arc A of S 2 and the chord A' that joins the endpoints of 
A. In some sense, the chord A' is a linear approximation to the arc A and 
the derivative E' is a linear approximation to the function E. 
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6.2 Strategy of the Proof 



Lemma 16.11 really just involves the single variable function <f)(x) = F(A' X ) 
defined for x £ [0, 1]. In §6.31 we prove the following easy estimate. 



Lemma 6.2 



(1 - x)<f>(0) + x(j>{\) - <f>(x) <—; H — sup (j)"{x). 

° xe[o,i] 



Here 4>"(x) is the second derivative with respect to x. 

Recall that S is the length of S. Let s denote the arc-length parameter 
along the segment S. We set things up so that s = corresponds to So and 
s = 5 corresponds to Si. In general, the parameter s £ S corresponds to 
x = s/5 £ [0,1]. In §6.51 we establish the following result. 

Lemma 6.3 Suppose we have the hypotheses in Lemma \6. 11 Then, 

By the Chain Rule, we have 

0"(x)=5 2 ^. (100) 

at corresponding points. 

Lemma 16.11 follows from Lemma I6.2[ Lemma I6.3[ and Equation 11001 



6.3 Proof of Lemma 16.21 

We will prove Lemma [6.21 for a smooth function h : [0, 1] — > R. We want to 
show that 

TT 

(1 - x)h{$) + xh{l) - h(x) < — ; H — sup h"(x). (101) 

8 xe[o,i] 

Let c be any nonzero constant and let L be any linear function. Equation 
I1U1I holds for the function h if and only if it holds for ch. Likewise, Equation 
11011 holds for h — L if and only if Equation 1X01 1 holds for h. Using these two 
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symmetries, it suffices to prove Equation 110 II in the case when h(0) = h(l) = 
and H = 1. In this case, Equation 110 II simplifies to 

-h(x)<± (102) 

Let a G [0, 1] be a point where —h attains its maximum. That is, h attains 
its minimum at a. Replacing h by the function x — > h(l —x) if necessary, we 
can suppose without loss of generality a > 1/2. 

We have h'(a) = and, by the Fundamental Theorem of Calculus, 

h'(x) = ! h"(t)dt <x-a (103) 

J a 

for all x G [a, 1]. But then 

r 1 C 1 (I -a) 2 1 

- h(a) = h(l) - h(a) = / h'(x)dx < / x - a dx = — < - (104) 



This completes the proof. 4 



6.4 Comparison Lemmas 

Let p and / be as above. Say that a flag is a pair (q, L) where L is a line 
and q G L is a point, and q^ p. We define the following quantities. 

• r(q, L) is the distance from p to q. 

• d(q, L) is the distance from p to L. 

• 9(q, L) is the small angle between pq and L. 

• D(q,L) = F"(q), the second derivative w.r.t. arc length on L. 

We want to compare flats (q\, Li) and (52, L 2 ). We set #1 = 9(qx, Li), etc. 
Our main result is Corollary 16. 6[ proved at the end. This result will help us 
establish Lemma 1631 

For each j we can rotate so that everything takes place in C, and pj = %Tj 
and Lj = R, and qj = Xj. Here xj + d 2 = r|. Suppressing the index, we have 

D = ^-E(y/tf + x^) =E"(r)^- + ^x^-. (105) 

dx z x t 1 r r z 

Here E' and E" are derivatives taken with respect to r. 
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Lemma 6.4 If 6 2 = 61 and r 2 < r\ and D\ > then D 2 > Di. 

Proof: In our situation, there is some constant p G (0, 1] so that 

— = — = — = p. (106) 
ri x 2 d\ 

It follows from Equation [77] that there are constants c > 1 and h > such 
that 

= c ^i) £ // (r2) = (c + / i ) £ "( ri ). (107) 

r 2 n 

It now follows from Equation 11051 that 

D 2 - cD x = hE"(n) > 0. (108) 
Hence D 2 > cZ?i for some c > 1. The lemma follows immediately. 4* 



Lemma 6.5 If 6 2 < Q\ and r 2 = r\ then D 2 > D±. 

The quantity d(q,L) is monotone increasing with 6(q,L). Thus, we have 
d 2 <d\. This time, we have 

r ■= r 2 = rx\ d 2 < d X ] x 2 > x x . (109) 

We again have Equation 11051 Note that E"(r) > and E'(r) < 0. When 
we change from the index j = 1 to the index j = 2, we do not decrease 
the positive coefficient of E"(r) in the first term and we do not increase the 
positive coefficient of E'(r)/r in the second term. Hence, D 2 > D\. 4jk 

The previous two results combine to prove the following corollary. 
Corollary 6.6 If 9 2 < 0\ and r 2 < r x and D\ > 0, then D 2 > D\. 

6.5 Proof of Lemma 16.31 

We continue using the notation from above. There is a unique plane n such 
that p U L C n. We rotate the picture so that n is the xy-plane. Let 
C = S 2 fl n. Then, as shown in Figure 6.1, 
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1. C is a circle whose radius is at most 1; 

2. Every point of S is at least R units from p. 



c 




Figure 6.1: The chord and the circle 



We are interested in bounding the quantity D(q,L), where L is the line 
containing the chord S. The chord S is subject to the two constraints men- 
tioned above. If C has radius less than 1, we replace C by a unit radius 
circle C, and S by a larger segment S' such that the pair (C, S') satisfies 
the same constraints, and the flag (q, L') is the same as the flag (q, L). Figure 
2.2 shows the construction. 
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Figure 6.2: Expanding the circle 



So, without loss of generality, we can assume that C is the unit circle. 
Also, it suffices to consider the case when D(q,L) > 0. Let q\ = q. Let r\ 
be the distance from p to q±. Note that r± > R. Let q2 denote a point on C 
that is exactly R = r<i units from p. Let L2 be the line tangent to C at q2- 
We want to apply Corollary 16.61 to the flats (qi, Li) and (q 2 , L 2 ). We already 
know that r 2 < r x . 

Lemma 6.7 d% < Q\. 

Proof: See Figure 6.3. Let q% be the endpoint of 5* such that the small 
angle 81 subtends the arc of C between p and q%, as shown in Figure 6.3. 
Let L 3 be the line tangent to C at q 3 . Let 9 3 = 9(q 3 ,L 3 ). The angle 61 is 
half the length of the two thick arcs in Figure 6.3 whereas the angle 83 is 
half thelength of the thick arc joining p to q 3 . Hence #3 < 9\. But the angle 
#3 decreases as we move ^3 towards p along C . Therefore 82 < 8 3 . Putting 
these two inequalities together, we find that 8 2 < 8\- 4k 
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c 



ql 



q3 



L3 



LI 



Figure 6.3: Comparing two flags. 

Corollary 16.61 now says that D(q 2 ,L 2 ) > D(qi,Li). To finish our proof, 
it remains only to compute the quantity D(q 2 ,L 2 ). 

We know that r 2 = R. Some elementary geometry shows that 



do 



R2 2 j 

— ; x 2 = r - d 2 
Plugging Equation IX 101 into Equation 11051 and simplifying, we get 



D(q 2 ,L 2 ) = ^E'(R) 



D2, 



the bound from Lemma 16 .31 Now we know that 



D(q,L)<-E>(R) + (l- T )E"(R), 



110) 



111) 



(112) 



for all flags (q, L) such that \\p — q\\ > R. But D(q, L) is just another name 
for the quantity d 2 (p/ds 2 featured in Lemma 16.31 This completes the proof 
of Lemma 16. 31 
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7 Parametrizing Arcs and Segments 



7.1 Overview 

Let S C C be a line segment. In this chapter we define a certain parametriza- 
tion of S by a parameter x G [0, 1]. Once we define our parametrization, we 
will state and prove several geometric results about it. As with the last chap- 
ter, the reader might want to just note the results on the first reading and 
then come back later for the proofs. 

Let A be a circular arc on S* 2 , and let A' be the chord that joins the 
endpoints A and A-y of A. Throughout the chapter, we assume that A is 
contained in a semicircle. We let x — > A' x be the afline map from [0, 1] to 
A'. Let C be the circle containing A, and let c G C be the point which is 
diametrically opposed to the midpoint of A. We define A x so that the three 
points c, A' x , A x are always collinear. Here is our first result. 

Lemma 7.1 The function f(x) = \\A X — A' x \\ attatains its maximum at x = 



Now we return to our main task of parametrizing a segment S C C. Let 
S* = S _1 (S'). The method above gives us a parametrization of S*. Now we 
define 



As usual £ denotes stereographic projection. 

Suppose Q is a normal dyadic square. This means that Q does not cross 
the coordinate axes, and the side length of Q is at most 1. We consider the 
case when Q is contained in the positive quadrant. The other cases have 
symmetric treatments. Let Q and Qi be the left and right edges of Q. For 
any x G [0, 1], let Q x denote the segment connecting (Qo) x and (Qi) x . The 
main result in this chapter gives estimates on the size and shape of the image 



1/2. 



S x = Z(S* X ). 



(113) 



of Ql 



s- 1 



(Q x ). 



Lemma 7.2 Q* x has arc length at most 



5 x 1.0013. 



and is contained in a circle of radius at least 

1 
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7.2 Proof of Lemma 17.11 



Our result is scale-invariant. It suffices to prove the result when A is an arc 
of the unit circle, as shown in Figure 7.1. The arc cy is evidently shorter 
than the diameter ex. On the other hand, the arc cz is evidently longer than 
the arc cw. Hence the arc yz is shorter than the arc wx. This is what we 
wanted to prove. 



7.3 A Lemma about Slopes 

The rest of the chapter is devoted to proving Lemma 17.21 Here we reduce 
Lemma [7.21 to a more geometric statement. 

Lemma 7.3 Q x has slope in (0,0.051) for all Q and x G (0, 1). 

Proof of Lemma 17. 2t Let S = Q x . We deal first with the arc length of 
S* . This is really the same argument as in Lemma 13.41 When evaluated at 
all points of S, the quantity in Equation [7] is at most 8/s. Since S has slope 
in (0, 0.051) and Q has side length s, the segment S has length at most 




y 



X 



Figure 7.1: The relevant points 



s x 



V 7 ! + (0.051) 2 < 1.0013. 



(114) 
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The arc length estimate on S* follows by integration. 

Since the line L through S has positive slope, it intersects the imaginary 
axis in a point of the form iy where y < y. But then, according to Equation 
[61 there are two points on (L U oo)* which are at least 

1 

apart. 4k 



The rest of the chapter is devoted to proving Lemma 17.31 



7.4 Mobius Geometry 

Let the vertices of Q be Qoo, Qw, Qu, Qoi, starting in the bottom left corner 
and going counterclockwise around. Associated to Q is an auxilliary map 
: [0, 1] — >■ [0, 1], defined as follows. 

• The point (1 — x)Q 00 + xQ i is (Qq) s for some s. 

• The point (Qi) s is (1 - y)Q w + yQ u for some y = <f)(x). 

In other words, we consider the natural map (Qo) s — > {Qi) s but we precom- 
pose and postcompose with afline maps to make the domain and range equal 
to [0, 1]. Lemma [7.21 is equivalent to the statement that 

< <f>{x) - x < 0.051; Vxe(0,l). (115) 

Given 4 distinct points A, B,C,D G R n , we have the cross ratio 

X (w.g) |g:g;; . m 

We say that a homeomorphism from one curve to another is Mobius if the 
map preserves cross ratios. In case the curves are line segments in the plane, 
a Mobius map between them is the restriction of a linear fractional transfor- 
mation of C U oo. 



Lemma 7.4 is Mobius. 
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Proof: Since similarities are Mobius transformations, it suffices to prove 
that the map (Qq) s — > {Qi) s is a Mobius transformation. Stereographic 
projection is well known to be a Mobius map from any line segment in C to 
the corresponding arc on S 2 . Referring to the construction in the beginning 
of §7.1[ the map from A x to A' x is just the composition of afhne maps with 
(one dimensional) stereographic projection. Hence, this map is also Mobius. 

Let A = Qq and A' be the chord connecting the endpoints of A . Like- 
wise define A\ and A' x . The map of interest to us is the composition 



The outer maps are Mobius, from what we have already said, and the middle 
map is affine. 4k 

A Mobius map from [0, 1] to [0, 1] is completely determined by its deriva- 
tive at either endpoint. Thus, we can understand by computing or esti- 
mating 0'(O). In our next result, we think of <f)'(0) as a function of the choice 
of dyadic square Q. As in Equation [7J define 



Qo ->• A ->• A' ->• A[ A x ->• Qi. 



(117) 



g(x,y) 



2 



(118) 



1 + x 2 + y 2 



Define 



g(Qoo)g(Qn) 
g(Qw)g(Qio)' 



(119) 



Lemma 7.5 Relative to Q, we have 





In particular, is the identity iff Gq 



1. 



Proof: If follows from symmetry that 



</>W(i) = i. 



(120) 



Therefore 




(121) 
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Looking at the composition in Equation I117[ all the maps except the outer 
two have the same derivative at either endpoint. For the affine map in the 
middle, this is obvious: the derivative is constant. In the case of the map 
Aj — > Aj this follows from the fact that we are projecting from a point Cj 
that is symmetrically located with respect to A', and Aj. Call this property 
of the derivatives the symmetry property. 

By Equation the quantity g(Qoo) is the norm of the derivative of 
at Qoo- The other quantities g{Qij) have similar interpretations. It therefore 
follows from the symmetry property and the Chain Rule that 

</>'(o) _ g(Qoo)/g(Qoi) (]22] 
m g{Q w )/g{Qii)' 1 ] 

This Lemma now follows from Equations 11211 and 11221 4k 



7.5 The End of the Proof 

For the purposes of doing calculus, we define (j> an d G relative to any square 
that is contained in the positive quadrant. We remind the reader that we 
only consider squares that have side length at most 1. 



Lemma 7.6 It never happens that Gq = 1. 



Proof: When Q 00 = (x, y) and Q has side length r, we compute 

Q _ (1 + r 2 + 2rx + x 2 + y 2 )(l + r 2 + 2ry + x 2 + y 2 ) 
Q ~ (1 + x 2 + y 2 )(l + 2r 2 + x 2 + y 2 + 2rx + 2ry) ^ ' 

Every factor in Equation 11231 is a polynomial with only constant coefficients 
and at least one constant term. Hence this expression never vanishes. 4k 



When Q = [0, l] 2 , we compute that Gq =4/3. Hence 0'(O) > 1 relative 
to this choice of Q. It follows from continuity that <p'(0) > 1 relative to any 
square in the positive quadrant. But this means that is increasing. (Here, 
of course, we are crucially using the fact that is a Mobius map from [0, 1] 
to [0, 1].) Hence 4>(x) > x. This proves that the slope of the segment Q x is 
positive for all Q and all x > 0. This proves half of Lemma 17.31 Now we 
turn to the other half. 
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Lemma 7.7 The quantity Gq is maximized when Q has side length 1 and 

Proof: Let ip(x, y, r) be the function in Equation 11231 We compute symbol- 
ically that 

d± = A(s,y,r) 

dr (1 + x 2 + y 2 ) 2 (l + 2r 2 + x 2 + y 2 + 2rx + 2ry) 2 ' 1 ' 

Where A is a polynomial with entirely positive terms, at least one of which 
involves only r. (The polynomial is rather long and unenlightening.) From 
this we conclude that difi/dr > 0. Hence, the maximum value of Gq must 
occur when r = 1. 

We compute that 1) = 3/2. So, we just have to show that the 

function h(x,y) = 3/2 — ip(x,y, 1) is non-negative on the positive quadrant. 
We compute that h is a rational function. The denominator is a polynomial 
with only positive terms, and the numerator N equals 

1 - 2x + 4x 2 + 2x 3 + x 4 -2y- 8xy + 2x 2 y + 4y 2 + 2xy 2 + 2x 2 y 2 + 2y 3 + y 4 (125) 

In fact, iV is non- negative on the entire plane. To see this, we make the 
change of variables 

x = i + u- y = i + v. (126) 

With this change of variables, we find that N — (2u — 2v ) 2 is a polynomial 
involving only positive terms. 4* 

In light of the previous result, the quantity <p'(0) is maximized for the 
special square Q from Lemma [7771 Since G = 3/2 in this case, we have 

0'(O) = y^/2. (127) 
But this equation pins down uniquely, and we observe that the map 

^ ~ 2^3 + 3^-2^2;' < ^ 128 ' ) 

has the same derivative. Hence, this is the correct formula for 0. A bit of 
calculus now shows that 

<f>(x) - x < 0.051; VxG[0,l]. (129) 

This completes the proof. 
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8 Proof of Lemma 



5.2 



8.1 The Geometry of Circles 

We need one more result about circles. 

Lemma 8.1 Suppose that A is an arc of a circle C. Let d be the arc-length 
of A. Let A' be the segment connecting the endpoints of A. Let r be the radius 
of C. Let \x be the smallest constant such that every point of A is within ii 
of some point of A' . Then 

d 2 

a < — . 

P 8r 



Proof: Let's first consider the case r = 1. We rotate so that C is the unit 
circle, and A is the arc bounded by the points exp(— id) and exp(i9). Here 
6 e (0,tt). Then 

d = 29; fi=l-cos(0). (130) 
The claim of this lemma boils down to the statement that 

> 2, (131) 



1 - cos(0) 

This is equivalent to the statement that 

(f)(6) = 6 2 + 2 cos(0) - 2 > 0. (132) 

We have 0(0) = and 

(f>'(6) = 2(6 - sin(0) > 0. (133) 

This proves what we need. 

If r 7^ 1, we let T be a dilation that scales distances by a factor of 1/r. 
The arc T(A) has length d/r and T(C) has radius 1. We apply our result to 
the pair (T(A),T(C)) and find that every point of T(A) is within 

(d/r) 2 
8 

of T(C). Applying T _1 , we get the desired result for the pair (A, C). 6 
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8.2 Lemma 15.21 for dyadic segments 

8.2.1 Defining the Weighting 

Suppose that (Q, Q) is a reasonable pair, and Q is a dyadic line segment. 
The construction in the previous chapter gives us a parametrization x — > Q x . 
We take Qq to be the left endpoint and Q\ to be the right endpoint. The 
corresponding endpoints of Q* are Qq and Q*. We define our weighting as 
follows. Letting z = Q x , we define 

X (z) = l-x; \ l (z)=x- (134) 

8.2.2 Setting up the Calculation 



To bring our notation in line with Lemma 16.11 we define 

A = A(Q* ,Ql) = Q*; A' = A'(Q* , Q\). (135) 
Let (z,w) G Q x Q. Define 

p = Y>- l (w)] q = Y>- l (z). (136) 

Setting x = Ai(^), we have 

q = A x . (137) 

We also define 

q' = A' x (138) 

The idea of our proof is to estimate things with q' in place of q, and then to 
estimate the error we get when replacing q 1 by q. 

8.2.3 Using Lemma 16.11 

Let F be as in Equation [98j We have 

Y,Uz)f(Qu,w) = (l-x)F(A' )+xF(A' 1 ); 

u 

F(A' x ) = E(\\q'-p\\). (139) 



Lemma [6. II now tells us that 

J2^(z)f(Qu,w)-E(\\p-q'\\) ^maxCO.Ax) 5 2 (140) 
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8.2.4 The Easy Case 

Now we need to see what happens when we replace q' by q. Define 

r = \\p ~ l\\ r ' = \\p ~ q'W- (141) 

If r' > r then (since E is decreasing) 

E(\\p-q\\)>E(\\p-q'\\). (142) 

In this case, our proof is done: Equations 11421 and 11401 combine to give a 
tighter bound than what Lemma 15.21 gives. 

8.2.5 The Hard Case 

Now suppose r' < r. Since E is convex, E' is monotone decreasing. Therefore 

E(r') - E(r) < E'{r')\\q - q'\\ < E\R)\\q - q\\. (143) 

Here R > r' is as in Lemma 15.21 

The same proof as in Lemma 17.21 shows that A has arc length at most 
5. Also A is contained in a great circle - i.e. a circle of radius 1. Lemma 
18.11 now says that every point of A is within 5 2 /8 of A'. But the point of A' 
closest to A1/2 is A' x , 2 . Therefore 

\\q - = \\A X - A'J <* \\A 1/2 - A' 1/2 \\ < j. (144) 

The starred inequality is Lemma 17.11 Combining Equations 11431 and I144[ we 
find that 

E(r')-E(r)<E'(R)— = A 2 5 2 . (145) 

8 

Note that 

f(z,w)=E(\\p-q\\). (146) 

Hence, by Equation 11461 

E(\\P ~ Q'W) ~ /(^ w) = E(r') - E{r) < A 2 S 2 . (147) 

Adding Equations 11401 and 11471 we get the bound in Lemma 15.21 This com- 
pletes the proof in case Q is a dyadic segment. 
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8.3 The Weighting for Dyadic Squares 

Let Q be a dyadic square and let Q* = Let Q a b be the vertices of 

Q, as in §7.41 Let Q* ab be the corresponding vertex of Q*. As we make our 
construction, the reader should picture the letter 'H', with the horizontal bar 
very slightly slanted. We will use coordinates (h,v) G [0, l] 2 . The v vari- 
able moves along vertical segments and the h variable moves along (roughly) 
horizontal segments. 

Let Qo be the left edge of Q. The two endpoints of Qo are Qoo and Qoi- 
Likewise, let Q\ be the right edge of Q. The two endpoints of Qxq are Qn. 
Using the parametrization from the previous chapter, we define 

Qov = (Q )v e Q ; Qi v = {Qi) v e Q v (148) 

Next, we define Q v to be the segment joining Q 0v to Q iv . Note that Q v is 
nearly horizontal but not exactly horizontal. See Lemma 17.31 Finally, we 
define 

Qhv = (Qv)h, (149) 

again using the parametrization discussed in the previous sections. We define 
our weighting as follows. Assuming that z = Q v h, 

Aoo(z) = (l-h)(l- v); X w (z) = (h)(l - v); 

\ 01 {z) = (l-h){t); \ n {z) = (h)(v). (150) 

In the previous section, we defined a weighting for dyadic segments. We 
could equally well define this weighting for any segment, since we have already 
explained how to parametrize any such segment. There are three ways in 
which our weighting here is compatible with the weighting for segments. 

• With respect to the segment Q , the weighting for the point So v is 
X = 1 — v and Ai = v. 

• With respect to the segment Qi, the weighting for the point S\ v is 
Xq — 1 — v and Ai = v. 

• With respect to the segment Q v , the weighting for the point Qhv is 
Ao = 1 — h and Ai = h. 
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8.4 The End of The Proof 

For ease of notation, we will assume that our dyadic square lies in the positive 
quadrant. The other cases are similar, and indeed follow from symmetry. We 
gather 4 pieces of information. 

1. The circles containing the arcs Qq and Q\ have radius at least 

1 

y/l + x' 

This follows from Equation [6] and from the fact that the line extending 
a vertical edge of Q comes within x of the origin. 

2. The arcs Qq and Q\ have length at most 5. This follows from the same 
argument as in Lemma [7.21 

3. For any v G [0,1], the line extending the segment Q v comes within y 
of the origin. Hence, the circle containing Q* v has radius at least 

1 

See Lemma [7T2l 

4. For any v G [0, 1], the arc Q* v has length at most (1.0013) x 5. See 
Lemma 17.21 

Now we are ready for the main argument. The basic idea is to make 
repeated appeals to the segment case of Lemma 15.21 and then to suitably 
average the result. 

We define 

A lx = A ly = Ax/2; 
A 2 , = -^V / TT^; A 2y = -^§V^¥- (151) 

We have 

Ai = A lx + A ly ; A 2 = A 2x + A 2y . (152) 

Let p G (Q)* be some point. Let w = Let / be as in Lemma [5.21 

That is 

f(z,w) = E(\\J:-\z)-J:-\w)\\). (153) 
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Applying the segment case of Lemma [5.21 to the arc Qq, we find that 

Z x := (l-v)f(Q 00 ,w) + vf(Q 01 ,w)-f(Q 0v ,w) < (maxA lx + A 2x y 2 (154) 

The proof is exactly the same as in the previous section, except for the one 
point that the circular arc Qq lies not necessarily in a great circle but rather 
a circle whose radius is bounded by Item 1 above. The designation Z\ is for 
algebraic purposes which will become clear momentarily. Similarly 

Z 2 := {l-v)f{Q w ,w)+vf{Q u ,w)-f{Q lv ,w) < (maxA 1:c + A 2x )5 2 (155) 

Finally, an argument just like the one given for dyadic segments also works 
for the segment Q s . The only property we used about dyadic segments is 
that they don't cross the coordinate axes, and Q s has this property. Using 
Items 3 and 4 above in place of Items 1 and 2, the same argument gives 

Z 3 := (l-h)f(Q 0v ,w)+hf(Qi v ,w)-f(Q hv ,w) < (max A ly + A 2y ^5 2 (156) 
The key point here is that 

7.98 < 8/1.0013. 

The number 1.0013 comes up in Item 4 above. 

Concentrating on the left hand sides of Equations 1154} 1155} and 1156} we 
have 

(1 - h)Z x + hZ 2 + Z 3 = 
(1 - h)(l - v)f(Q 00 ,w) + (1 - h)(v)f(Q 01 ,w)+ 
(h)(l - v)f(Q 10 ,w) + {h){v)f(Q n ,w) - f(Q hv ,w) = 

(^^Kb{z)f{Qab,w) s j-f(z,w)] z = Q hv . (157) 

ab 

Concentrating on the right hand sides, we see that the quantity in Equation 
11571 is at most 

(1 - h) ( max Ai a + A 2x ) 5 2 + (h) ( max A lx + A 2x ^5 2 + ( max A ly + A 2y ) 5 2 = 

(max(A 1 ,0) + A^S 2 . (158) 

This proves Lemma [5.21 
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9 The Hessian and its Variation 



9.1 Main Part of the Proof 

In this chapter we prove Lemma I2~4l Let H e denote the Hessian of S, the en- 
ergy function, relative to the function E(r) = r~ e . Let Z be the configuration 
corresponding to the TBP, normalized as in Equation | 



Lemma 9.1 The lowest eigenvalue of H e exceeds 1/10 for e = 1,2. 

Proof: Let M be either of the two matrices. Let Ij be the 7x7 identity ma- 
trix. Using a modified version of the Cholesky Decomposition, as discussed 
in [Wa, p 84], we write 

(M - -L/ 7 ) = LDLK (159) 

where L is lower triangular, D is diagonal matrix with all diagonal entries 
positive, and V is the transpose of L. This suffices to show that M — -^I 7 is 
positive definite. Hence, the lowest eigenvalue of M is at least 1/10. 4jfc 

Given a square matrix M, we define 

iimii 2 = (E a 4) 1/2 - ( 16 °) 

We mention a familiar and useful property of this norm. 

Lemma 9.2 For any unit vector v G R 7 we have \\M(v) ■ v\\ < ||M||2. 

Proof: We have 

1/2 



\\M(v)-v\\ 2 =\\J2M ij v l v J \\ <* ||M|| 2 



ViVjf 



\\Mh\j(Y. v ^){X v l) = l|M|| 2 |kl| 2 = ||M|| 2 . (161) 
The starred inequality is the Cauchy-Schwarz inequality. 4ft 

We write M' -< M if |M^| < M {j for all indices. Recall that s = 2~ n , as 
in Lemmas 12.31 and 12.41 Let Dk denote the partial derivative with respect to 
the kth direction in R 7 . 
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Lemma 9.3 (Variation Bound) Let^i^, be the smallest non-negative 
matrices such that DkH e (Z) -< ^k,e f or 0,11 k and all Z G Q s . Let 



*(e) = ||£> 



k.e 



k=l 



Then < 345 and tf(2) < 140 and, sup e ^(e) < 463. 

Corollary 9.4 Let Z be the center ofCl s (i.e., the TBP) and let W E Vt s be 
any other point. Then 



\H e (Z) - H e (W)\\ 2 < ± 



Ve G (0,oo). 



Proof: Let H = H e . Say that a special path in Q is a 7-segment polygonal 
path 7 such that the ith segment is parallel to the ith coordinate direction. 
We can connect Z = Z to W = Z 7 by a special path 7, all of whose 
segments have length at most 2 -12 . Let Zq, Z 7 be the vertices of 7. Let 
Aij = H(Zi) — H(Zj). Integrating along the fcth segment, we get the bound 
A fc x -< 2- 12 ^ k . Hence 



\H(Z)-H(W)\\ 2 =\\A 70 \\ 2 = \\j2&k. 



fc-i 



fc=i 



< 



)12 



< 



345 



This last quantity is less than 1/10. 4k 



4096 
(162) 



Proof of Lemma 12.41 ; Let e = 1 or e = 2. Let H = H e . Let W G f2 s be 
some point. We want to show that H(W) is positive definite. Let A = H(Z) 
and B = H(W). By the preceding Corollary, we have B = A + A with 
||A|| 2 < 1/10. 

Now let v G -R 7 be any unit vector. We have 



Bv ■ v = Av ■ v + Av ■ v > Av ■ v - \Av ■ v\ > — - ||A|| 2 > 0. (163) 

Hence B is positive definite. The fact that Av -v > 1/10 comes from the fact 
that the lowest eigenvalue of the symmetric matrix A is greater than 1/10. 4t 
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9.2 A Stronger Bound 

Consider the function 

4>{x) = x e/2 (164) 

and the interval 



i+i + 2- 
4 2 



(165) 



Define 

d k (f) 

c fc (e) = sup— (x) (166) 

Setting Cfc = Cfc(e), we define 

T(e) = ^/l9336cf + 19036c!C 2 + 4922c| + 1474 Cl c 3 + 772c 2 c 3 + 31c§. (167) 

Our next result uses the notation from the Variation Bound. 
Lemma 9.5 (Variation Bound II) ^(e) < T(e) for all e G (0, oo). 
We find easily that 

(c 1 (l),c 2 (l),c 3 (l)) = (1,2,12). (168) 
Plugging this into Equation I167[ we get ^(1) < 345. Similarly, we have 

(ci(2),c 2 (2),c 3 (2)) = (1,0,0). (169) 

This yields (2) < 140. 

Some elementary calculus shows that 

sup Ci(e) < 1.1; supc 2 (e) < 3.2; sup c 3 (e) < 16. (170) 

e€(0,oo) e eG(0,oo) 

We omit the details, because we don't use the general bounds anywhere in 
our main proof. When we plug in these bounds we find that \P(e) < 463 for 
all e. Hence, the Variation Bound II implies the Variation Bound. 

Remark: The Variation Bound II is generally much better than the Varia- 
tion Bound. For instance, T(e) — > as e — > or as e — > oo. 
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9.3 Proof of the Variation Lemma II 



Our proof usually suppresses the dependence on the exponent e. 

A point (zq, Zi, z 2 , Zs) G Q s is such that each z m lies within a square 
A m+ i of side length 2~ u about one of the points of the TBP configuration, 
normalized as in Equation 1301 Let Ri be the union of these 4 squares, 
Ai,...,A 4 . Let R 2 C C 2 denote the set of points Zi,z 2 which arise as a 
disjoint pair of finite points of a configuration of Q s . Here R 2 consists of 
12 components, all of the form A,; x Aj. The components A« x Aj and 
Aj x Aj are partners. We let A5, A10 be 6 components, no two of which 
are partners. 

Recall that S _1 is inverse stereographic projection. See Equation Set- 
ting z = x + iy, define 

w x 1 l + x 2 + y 2 

{z) = FHiJ^RF = — i — • (171) 



C(z z) 1 (l + xj + y!)(l + x 2 2 + y 2 ) 

{h2) ||S-i(^i)-S-i(^)|| 2 4(x 1 -x 2 ) 2 + ( 2/1 - 2/2 ) 2 • 1 ] 

Let (j) be as in Equation 11641 According as m < 4 or m > 5, define 

U m = (f)oU m] U m = F or G (173) 

Define 

$(i,j,k,m) = sup \DiDjD k U m \; §(i,j,k,m) = sup | DiDjD k \J m \. (174) 

Here i, j, k G {1, 7} and m G {1, 10} and D k is the fcth partial deriva- 
tive. 

We have 

10 

8(z , z 1: z 2 , z 3 ) = U m , (175) 

m=l 

where the arguments of the functions on the right hand side are suitably 
chosen sub-lists of (z , zi, z 2 , z 3 ). Therefore 

10 

|£> fc A£;(£)l < Mi, 3) :=Y,^^3^,m). (176) 

m=l 
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With this definition of we have D^H -< Summing over k, we have 



X>*|| ^ ||^$(^J,fc,m) . (177) 



fc=l fc,m 



Say that i G {1, 7} is related to an index m G {1, 10} if a change 
in the coordinate Xj moves one of the points in the argument of U m . For 
instance i = 2 is related to m = 5 because changing x 2 changes the location 
of the point Z\ = X\ + 2x2, and the argument of t/5 is the point (zq, z\) G A 5 . 
Clearly j, = unless z, j, are all related to m. 

When all indices are related to m, we use the chain rule 

DiDjDkUm = <p' x DiDjDkUm + <\>" x DiU m x ^f/ m x L> fc £/ m + 

0" x DiDjU m x D k U m +(f>" x DjDfct/^ x DiU m +4>" x D k DiU m x DjU m . (178) 

Given the definitions of the regions i?i and -R 2 we have U m (A m ) C J, 
where J is the interval from Equation ll65l Combining this fact with Equation 
11661 and Equation 11781 we have 

$>(i,j, k, m) < Ci$(i,j, k, m) + c^(i, m)$(j, m)$>(k, m) + 
c 2 $(z, j, m)$(k,m) + c 2 $(j, fc,ra)$(z,m) + c 2 &(k,i,m)§(j,m). (179) 
Let <5 m be the center of A m . Define 

a(ij,k,m) = iD^DkUUSm)]- (180) 
We define a(i,j;m) and a(i;m) similarly. Define 

v = ((v(i),v(i,j),v(i,j,k)) = (-^—,0,o) or (—,—,—), (181) 
vv ui v v VlOOO / V200' 50' 10/ V ; 

according as m < 4 or m > 5. Let b(i; m) = a(i; m) + v(i), etc. 
Lemma 9.6 for all (i,j,k,m) we have 

m) < b(i, m); j, m) < j, m); $(i,j,k,m)<b(i,j,k,m). 
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Combining Equation I177[ Equation I179[ and Lemma 19. 6j we have 



( c 1 b(i,j,k,m)+ \ 
c 2 (6(i, j,m)6(fc,m)+ 
in ) c 2 (b(j, k,m)b(i,m)+ (182) 
c 2 (b(k,i,m)b(j,m)+ 

\c 3 b(i,m)b(j,m)b(k,m) J 

Here 5(i,j, k, m) = 1 if k are all related to m, and otherwise 0. When we 
compute the norm of the right hand side in Mathematica, we get the square 
root of a polynomial in 01,02,03. Rounding the coefficients of this polynomial 
up to integers, we get T. This completes the proof of the Variation Bound 
II, modulo the proof of Lemma 19.61 



9.4 Proof of Lemma 19.61 

When m < 4, we are dealing with the function F from Equation 11711 We 
have DiF = Xi/2 and all higher derivatives are constant. Lemma [9.61 in this 
trivial case now follows from the fact that radius(A m ) < 1/500. 

For the nontrivial cases, we fix some value of m G {5, 10} and consider 
the cube A = A m C C 2 . This cube has side-length 2~ n . The center point 
is 5 = 8 m . Define 

a k = \D k G(S)\; $ fe = max \D k G{ Zl , z 2 )\. (183) 

(zi,z 2 )eA 

The maxima here are taken over all kth partial derivatives D k - An exact, 
finite calculation, done for each of the 6 choices of m, yields 

a x < 1; a 2 < 4; a 3 < 18; a 4 < 96; a 5 < 600. (184) 

Now we give an estimate on $ 6 . Given a polynomial P G R[xi, yi,x 2 , 1/2], 
let \P\ denote the sum of the absolute values of the coefficients of P. We 
have the easy upper bound 

1^0*1,32)1 < |P|max(|^|,|^ 2 |) d ; d = degree (P). (185) 
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Here we have set Zj = Xj + iyj. Calculating symbolically we find that each 
6th partial derivative Dq has the following structure. There is a polynomial 
P, depending on the choice of derivative, such that 

P 

D 6 G = T — ; \P\ < 4519440; deg(P) < 10. (186) 

\\ z l ~ z 2\\ 

We (easily) have 

max(||*i||, \\z 2 \\, \\zt - z 2 \\~ 1 ) < 1 + 2" 8 ; V( y z 1 ,z 2 )eA. (187) 

Hence 

$ 6 < 4519440 x (1 + 2~ 8 ) 17 < 5000000. (188) 
Lemma 9.7 $ fc < a k + x 

Proof: Let {w^w^) be a point of A. We can connect (wi,W2) to a point 
(zi,z 2 ) G S 2 by a 4-segment polygonal path 7 such that the ith segment has 
length at most 2~ 12 and moves in the ith coordinate direction. This lemma 
now follows from integration. 4jk 

Now we apply Lemma 19.71 in an iterative way. 

$ 5 < 600 + 5000000 x 2~ w < 5483 (189) 

$ 4 < 96 + 5483 x 2~ 10 < 102. (190) 

|$ 3 -a 3 | < 102 x 2~ 10 < — ; $ 3 < 18.1 (191) 

|$ 2 - a 2 \ < 18.1 x 2" 10 < — ; $ 2 < 4.02. (192) 

50 

|$i -aJ < 4.02 x 2- 10 < — . (193) 

200 v ' 

The R.H.S. of Equation 11911 comes from the L.H.S. and Equation 11841 Like- 
wise, the R.H.S. of Equation 11921 comes from the L.H.S. and Equation 11841 
This completes the proof of Lemma 19.61 
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10 Computational Issues 



10.1 General Remarks 

Our calculations in this paper are of two kinds. The material in £|9] makes 
some exact calculations in Mathematica [W] and the rest of the paper makes 
calculations in Java. For the Mathematica calculations, we need to take 
derivatives of rational functions or their square roots, and evaluate them 
at elements of Q[v3]. We also need to simplify and group terms of some 
polynomials. Everything is manipulated exactly. The user who downloads 
our Java program will also find our Mathematica files in the same directory. 

The bulk of the calculations are done in Java, and these are what we 
discuss below. We take the IEEE- 754 standards for binary floating point 
arithmetic [I] as our reference for the Java calculations. This 1985 document 
has recently been superceded by a 2008 publication [12]. We will stick to 
the 1985 publication for three reasons. First, the 1985 version is shorter and 
simpler. Second, the portion relevant to our computation has not changed in 
any significant way. Third, we think that some of the computers running our 
code will have been manufactured between 1985 and 2008, thus conforming 
to the older standard. 

10.2 Doubles 

With a view towards explaining interval arithmetic, we first describe the way 
that Java represents real numbers. Our Java code represents real numbers 
by doubles, in a way that is an insignificant modification of the scheme dis- 
cussed in [I, §3.2.2]. To see what our program does, read the documentation 
for the longBitsToDouble method in the Double class, on the website 
http:/ /java.sun.j2se/1.4.2.docs/api. According to this documentation - 
and experiments verify that it works this way on our computer - our program 
represents a double by a 64 bit binary string, where 

• The first bit is called s. 

• The next 11 bits are a binary expansion of an integer e. 

• If e = 0, the last 52 bits are the binary expansion of an integer m. 

• If e 7^ 0, the last 52 bits are binary expansion of the number m — 2 52 . 
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The real number represented by the double is 



(-l) s x 2 e ~ 1075 x m. (194) 
Example: The double representing —317 is stored as the 64 bit string 

1/10000000111/001111010 . . .0. 
The slashes are put in to emphasize the breaks. Here s = 1 and (since e ^ 0) 

e = 2 10 + 8 + 4 + 1 = 1031. 
There are 44 zeros at the end of the word, and 

m = 2 44 (0 + + 32 + 16 + 8 + 4 + + 1) + 2 52 = 2 44 (317). 

Hence 

(_1)« x 2 e - 1075 x m = -317. 

Now we come to the main point of our discussion above. The non-negative 
doubles have a lexicographic ordering, and this ordering coincides with the 
usual ordering of the real numbers they represent. The lexicographic ordering 
for the non-positive doubles is the reverse of the usual ordering of the real 
numbers they represent. To increment x + of a positive double x is the very 
next double in the ordering. This amounts to treating the last 63 bits of 
the string as an integer (written in binary) and adding 1 to it. With this 
interpretation, we have x + — x + 1. We also have the decrement x_ = x — 1. 
Similar operations are defined on the non-positive doubles. These operations 
are not defined on the largest and smallest doubles, but our program never 
encounters (or comes anywhere near) these. 

10.3 The Basic Operations 

Let D be the set of all doubles. Let 

R = {x G R\ \x\ < 2 30 } (195) 

Our choice of 2 30 is an arbitrary but convenient cutoff. Let D denote the 
set of doubles representing reals in Rq. 

According to the discussion in [I, 3.2.2, 4.1, 5.6], there is a map Rq — > D 
which maps each x G Rq to some [x] G D which is closest to x. In case 
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there are several equally close choices, the computer chooses one according 
to the method in [I, §4.1]. This "nearest point projection" exists on a subset 
of -R that is much larger than Rq, but we only need to consider Rq. We also 
have the inclusion r : Dq — > Rq, which maps a double to the real that it 
represents. 

Our calculations use the 5 functions 

+; -; x; - (196) 

These operations act on Rq in the usual way. Operations with the same name 
act on D . Regarding the first 5 basic operations, [I, §5] states that each of 
the operations shall be performed as if it first produced an intermediate result 
correct to infinite precision and with unbounded range, and then coerced this 
intermediate result to fit into the destination's format. Thus, for doubles x 
and y. 

V r (\ x \) ; x * y = [r(x) * r(y)]\ * G {+, -, x, -=-}. (197) 



x 

The operations on the left hand side represent operations on doubles and the 
operations on the right hand side represent operations on reals. 

Remark: Exceptions to Equation [197] can arise if we divide by or a number 
too close to zero. This will produce an overflow error - too large a number 
to be accurately represented by a double. To see that this never happens, we 
check that we never divide by a number smaller than 2 -11 . Moreover, as we 
discuss at the end of this chapter, we never take the square root of a number 
less than 2~ 5 . 



10.4 Interval Arithmetic 

An interval is a pair / = (x, y) of doubles with x < y. Say that I bounds 
z e Rq if x < [z] < y. This is true if and only if x < z < y. Define 

[x,y] =[x.,y + ]. (198) 

This operationis well defined for doubles in D . We are essentially rounding 
out the endpoints of the interval. Let Iq and I\ denote the left and right 
endpoints of /. Letting / and J be intervals, we define 

/ * J = (min Jj * Ij, max/j * /,-)„; = (v^/o, a/7i) (199) 

63 



That is, we perform the operations on all the endpoints, order the results, 
and then round outward. Given Equation 11971 we have the following facts. 

1. If J bounds x and J bounds y then I * J bounds x * y. 

2. If / > bounds x > then a/7 bounds y/x. 

After we have defined intervals and their basic operations, we define other 
Java objects based on intervals. Namely, interval versions of complex num- 
bers, vectors in R 3 , dyadic segments and squares, and dyadic boxes. For 
instance, the interval version of a complex number is an object of the form 
X + iY, where X and Y are intervals. The algebra of these interval objects 
- e.g. complex addition or the dot product - is formally the same as the 
corresponding algebra on the usual objects. At every step of the calculation, 
the real version of the object is bounded, component by component, by the 
interval version. If some particular interval object passes our test, it means 
that all the real objects bounded by it also pass. 

10.5 Implementation Details 

We implement the interval arithmetic in a way that tries to minimize the 
time we spend using it. We run the floating point algorithm until we notice 
that a box has passed one of the floating point tests. Then, we re-perform 
the interval arithmetic test on the interval arithmetic version of the box. 

• If the box passes the interval arithmetic test, we eliminate it, and switch 
back to the floating point algorithm. 

• If the box fails the interval arithmetic test, we just act as if it failed 
the corresponding floating point test, and resume the algorithm. We 
call this case a mismatch. It is harmless. 

Even though the mismatches are harmless, we prefer to have few mis- 
matches, so that run time for the interval arithmetic version of the code is 
easier to predict from the run time of the floating point version. There are 
4 kinds of potential mismatches, corresponding to the 4 kinds of tests we 
perform, as discussed in §2.51 The two mismatches that actually arise (or, 
rather arose) in abundance are the Energy Estimator mismatches and the 
Redundancy Eliminator mismatches. Once we make our modifications, the 
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code runs completely without mismatches for the 1/r potential and with only 
a 4 mismatches for the 1/r 2 potential. 

To prevent the Energy Eliminator mismatches, we arrange the floating 
point eliminator so that a dyadic box passes only if the minimum energy is 
above Eq + 2~ A0 , where Eq is the energy of the T.B.P. In this way, the interval 
versions of the boxes that are actually checked have a bit of a cushion. The 
reason why this fudge factors does not cause our code to halt is that the 
inequality S(X) - E < 2" 40 only occurs well inside the L°° neighborhood of 
size 2" 14 about the TBP. 

Adding the fudge factor of 2~ 40 makes the floating point test harder to 
pass, but does not change the validity of the program. Logically, the float- 
ing point calculation simply finds a candidate partition of the configuration 
space, in which each box in the partition passes one of our tests. Adding the 
fudge factor does change the final partition a bit, but it doesn't destroy the 
basic property of the partition. 

Mismatches occur for the Redundancy Eliminator because we sometimes 
eliminate configurations where there is an exact equality of coordinates. This 
equality will fail for the corresponding intervals, because of a tiny overlap. 
To get around this problem, we associate to each dyadic object a Gaussian 
integer that represents 2 25 z, where z is the center of the object. When we 
subdivide a dyadic object, we perform the arithmetic on the Gaussian integer 
exactly, so as to compute the exact value of the centers of the subdivided 
objects. We never subdivide more than 24 times - and in fact the maximum 
is about 17 - so we never arrive at a situation where 2 25 z is not an integer. 
When it comes time to compare the various coordinates of a dyadic object, 
we actually compare 2 25 times those coordinates, so that we are working en- 
tirely with integers. This lets us make exact comparisons. 

There is one more fine point we would like to mention. We want to avoid 
taking expressions of the form \/7, where / is an interval that is too close to 
0. If G I, then J < and v^o 

causes an arithmetic error. The only time 
this issue comes up is in the bounds from Lemma 13.11 The quantity 4 — D 2 
might be close to if D is close to 2. To avoid this problem, we define a new 
function a, which has the property a(I) = 2 -5 if I\ < 2~ 10 and otherwise 
a(I) = yl. (It never happens that I\ > 2 -10 and Iq < 0.) When computing 
the interval version of the bound in Lemma I3.1[ we use a in place of ^J~. 
This causes no problem with the proof, because the bound is not as strong 
when we use a in place of y /~. 
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